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(continued from front flap) 


Both ALGOL and COBOL automatic 
programming languages are treated thor- 
oughly and lucidly. ALGOL is typical of 
other less general algebraic languages 
such as FORTRAN developed by various 
computer manufacturers. COBOL exem- 
plifies business-oriented, data-processing, 
automatic-programming languages. These 
chapters are noteworthy in that they pre- 
sent for the first time in book form these 
internationally accepted automatic pro- 
gramming systems. 


Also worthy of special attention are the 
chapters on programming for special pur- 
pose computers and programming to 
achieve intelligence. The latter chapter 
contains a description of automatic pro- 
gramming language translation; heuristic 
programming; programming to create 
music compositions; and programming to 
aid medical diagnosis. 


PROGRAMMING AND UTILIZING 
DIGITAL COMPUTERS will help fill 
the need for a modern, up-to-date college- 
level programming text. It contains ex- 
tensive automatic programming methods 
and other material of current interest in 
the programming field. Over one-third of 
the book is devoted to automatic pro- 
gramming alone. 


Note: Electronic News in its August 24, 
1961, edition reported that all computers 
purchased by the Department of Defense 
after January, 1962, must have a work- 
ing COBOL and ALGOL system. Since 
programmers will want to be up to date 
on these required automatic programs, 
the new Ledley book will furnish them 
with complete coverage of the systems. 
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DIGITAL COMPUTER AND CONTROL ENGINEERING 


By Roneny 8, Lupiny. MeGraw-Hill Series in Electrical and Electronic 
Engineering. 835 pages, $14.50 


A pioneer in the field of digital computers and digital controls. This book is 
a comprehensive elementary engineering textbook for advanced courses. Its 
unified treatment covers almost all aspects of the field from a detailed engi- 
neering point of view. The subjects covered were chosen to present a con- 
tinuous, natural development of the major areas of digital computers and 
their controls. 


PROGRAMMING AND CODING FOR AUTOMATIC DIGITAL COM- 
PUTERS 


By G. W. Evans II, Stanford Research Institute; and C. L. Perry, Uni- 
versity of California, La Jolla. McGraw-Hill Series in Information Proc- 
essing and Computers. 264 pages, $9.50 


Techniques, methods, and facts essential to programming digital computers 
efficiently are brought together in this practical book. It outlines systematic 
procedures that enhance programming and coding skills and explains the 
computer’s basic design so you can use it effectively. The book describes how 
the computer installation is established, organized, and operated for maximum 
performance. 


COMPUTER HANDBOOK 


By Harry D. Husky and Grantno A. Korn, University of Arizona. 
1288 pages, $25.00 


Gives detailed technical information and a wealth of industrial know-how 
on modern design practices in the field of analog-computer components and 
systems. Included are general-purpose analog computers and representative 
special-purpose machines. Many actual circuit diagrams are included as con- 
crete examples of design principles or for direct adaptation to the designer’s 
problems. 


INTRODUCTION TO ELECTRONIC DATA PROCESSING EQUIP- 
MENT—Its Operation and Control 


By Ropert V. OaxkForp, Stanford University. McGraw-Hill Series in 
Information Processing and Computers. 352 pages, $10.00 


Emphasizing tman-machine communicatiofis, this book explains how to com- 
municate with electronic computers and auxiliary punched card equipment 
and discusses the principles of the equipment’s operation. The book discusses 
existing rather than hypothetical equipment and languages. The IBM 650 is 
used us un example in the discussion of computer organization and machine 
language eemeiing, since it is representative, widely used, and yet less 
complex than are larger computers. 
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PREFACE 


Point of View. This book was written as a college text in programming 
digital computers; it can be used at various levels, ranging from sopho- 
more to first-year graduate. The book is intended to fill the great need 
for an up-to-date, comprehensive text to provide an introduction to the 
many aspects of the digital-computer programming field, including auto- 
matic-programming languages as well as machine languages, the pro- 
uramming of special-purpose computers as well as general-purpose com- 
puters, mathematically oriented as well as business-oriented applications, 
and the modern areas of heuristic programming as well as mathematical 
decision programming. Of course, an introductory exposition of a field 
an large and rapidly advancing as this can never hope to treat all subjects 
exhaustively. Full treatment must be left to specialized source books 
and journal articles. Hence each chapter is designed primarily to intro- 
duce the student to certain fundamental concepts and techniques of 
development. 

The book is divided into three parts, which consider respectively 
iiuchine languages, automatic-programming languages, and data-proc- 
ening techniques. I take the position that the use of machine languages 
encompasses the basic ingredients of programming, and hence Part 1 of 
the book is devoted to such languages. In addition to introducing the 
s(udent to digital computers and some of their applications, Part 1 dis- 
uses a variety of instruction types and memory-addressing methods 
‘lilized in currently commercially available computers. However, many 
of the present-day practices and developments in the programming field 
are centered about the concepts of automatic programming. Thus I 
have devoted Part 2 of the book entirely to automatic-programming 
irethods and applications, including an introduction to the algebraic 
language ALGOL (which is akin to FORTRAN in its basic aspects) and 
ihe business-oriented language COBOL, as well as a chapter intended 
io sketch some of the more modern ideas on programming to achieve 
intelligence, 

Throughout the book the method of teaching is by example rather 
(han by generalized exposition. I feel that the student can best grasp 
abstractions from specifics and develop skills through practice. Hence 

vii 
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Part 3 of the book is included primarily as a source of current methods 
and techniques in numerical analysis, logical analysis, and information 
handling, for use by the student to obtain more realistic material for 
problem analysis and programming practice. In addition, much atten- 
tion has been paid to the exercises, of which more than 500 appear in 
the book. Almost every chapter ends with a bibliography, to stimulate 
the student in further reading in the field. 

Outline of the Text. In Part 1 of the text, the first chapter is intended 
to motivate the student by delineating the wide range of applications of 
digital computers, and to orient the student with respect to the basic 
idea of a digital computer. The next two chapters are directly concerned 
with machine-language programming; their purpose is to expose the 
student to a large variety of instruction and operation formats, as well 
as the practice of coding. Sequences of instructions in three-, two-, and 
one-address-system instruction formats are considered, as well as the 
fundamental concepts of the loop, or iteration, and the subroutine. This 
in followed by a discussion of the various kinds of operations that instruc- 
tions can involve, and by consideration of many types of indirect-address- 
ing and relativesindexing memory-access methods. At the end of Part 


1, in order to illustrate clearly the possibility of other than general-pur- 
pone computers, the digital differential analyzer is considered, along with 
realetime control and other special techniques, Also, a general discussion 


is Inblided on the concepts underlying the “super” computers that have 
nowt recently been developed, 

When first being introduced to automatic programming, the topic of 
Part 2, the student almost invariably becomes somewhat preoccupied 
with the question of how the generalized instructions can actually be 
realized in terms of a machine language. Such preoccupation interferes 
with further learning of more complicated automatic-programming 
methods. Hence Part 2 begins with a chapter (Chapter 5) designed to 
offset such problems by describing in a relatively concrete way how simu- 
lation routines, address-assigner routines, algebraic-compiling routines, 
and so forth, can be coded. In addition, the important concepts of 
threaded and push-down lists and their uses are introduced. Hence by 
the time the student reaches Chapter 6 (on ALGOL) he has gained at 
least some intuitive understanding of how automatic-programming lan- 
guages can be coded, and he can then concentrate his attention more 
fully on the details of the languages themselves. 

Chapter 6 is designed as an introduction to ALGOL, and should not 
be considered as a full exposition of that language. I do feel, however, 
that a knowledge of the contents of Chapter 6 will enable the student 
easily to assimilate the actual details of ALGOL, as well as FORTRAN 
and other algebraically oriented languages. Tor pedagogical reasons, | 
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have given a simplified version of ALGOL, stressing the more important 
concepts involved. To enhance the clarity of the exposition, I have 
tuken the liberty of redefining some terms and changing some names (but 
| have tried to point out these changes and simplifications from ALGOL- 
40 in the footnotes). The symbolic metalanguage notation of the 
ALGOL-60 report} is introduced, for I feel that its use can enable a 
nore precise, unambiguous definition of an automatic-programming 
language. Even if at first sight it may have a tendency to appear formi- 
dable to the student, I believe that with a little experience he will find ita 
simple and often invaluable notation. In Chapter 7, I consider the 
husiness-oriented language COBOL, again in a version simplified | for 
pedagogical reasons. In order to avoid ambiguity, I here also utilize 
some of the metalanguage notation of the ALGOL-60 report. Although 
only the basic concepts of COBOL are considered, it is felt that with a 
knowledge of these, the student can easily digest the many detailed 
features of the actual COBOL language and other such data-processing 
languages. Since writing a program in the COBOL language really 
involves the use of an entire automatic-programming system, the chapter 
ends with the presentation of a complete example. 
The title of Chapter 8, the last chapter of Part 2, is intended primarily 
io stimulate the student’s imagination on some of the great potentialities 
that exist in the frontiers of programming research; however, I owe an 
apology to the programming teachers for this title, since quite obviously 
methods for achieving intelligence in computers are not described in the 
chapter. The theme of the chapter is the achievement on a computer of 
some realizations of the concepts of deductive and inductive inference. 
liv inductive inference I mean heuristic programming, specific examples of 
which are presented. In our enthusiasm for the new and unusual, we 
just not, however, forget the more staid and relatively conventional 
methods of mathematical optimization, which I call deductive inference. 
Hence I also briefly sketch some of these methods and describe their appli- 
cation to computer-aided medical diagnosis. The chapter begins with a 
discussion of automatic-programming language translation, since the 
methods of syntactical analysis introduced there are used later on. The 
chapter ends with what is labeled as a consideration of programming to 
achieve abstraction and creativity, but which is, of course, simply a brief 
reporting of some ideas which have appeared in the literature and which 
profess to pertain to this topic. Because of the great wealth of material 
available in the literature, I have found it necessary to limit the discussion 
io only a few of the research approaches I considered “typical” or illus- 
trative of some basic considerations—even so, because of the limited 


/ P. Naur et al., Report on the Algorithmic Language ALGOL 60, Communs. ACM, 
vol, 3, no. 5, pp. 209-314, May, 1960. 
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space that could properly be devoted to topics of this kind in an intro- 
ductory text, it was necessary to be very sketchy. The entire chapter is 
calculated to raise more questions than it answers, but if it stimulates the 
student to think further about these advanced topics, it will have served 
its purpose. Certainly a text of this type would be incomplete without 
such a chapter! 

Part 3 of the book is primarily included as a practice source in computer 
utilization techniques. The first chapter in this part, Chapter 9, con- 
siders elementary aspects of numerical analysis, stressing the concept of 
the polynomial-approximation approach for function evaluation and 
briefly touching upon ordinary and partial differential equations. The 
chapter concludes with a discussion of accuracy and error in numerical 
calculations. Chapter 10 considers computational methods in and 
applications of Boolean algebra. I believe that the role of Boolean 
algebra as an important tool and practical aid to the computational 
analysis of a large variety of problems (in fields other than switching- 
circuit theory and the logical design of digital computers) is not so well 
recognized as-it might be. In Chapter 10, first Boolean algebra is intro- 
duced in terms of the propositional-calculus representations and the 
class, or set, representations. The method of using bases and associated 
designation numbers (which was first fully developed by the author, 
although it appeared implicitly in some earlier writings) has been found 
wdmirably suited to the teaching of the more complicated methods in 
Hoolean algebra, such as the several modern procedures for the simplifica- 
lion of Boolean functions, the introduction of constraints into a problem, 
and the handling of logical dependence and independence. The final 
three sections of this chapter present methods which with a few excep- 
tions are based on the author’s original research. Here Boolean matrices 
are introduced and utilized for obtaining systematic computational 
methods for the solution to Boolean equations of a variety of types. 
Illustrations are then given of the application of these methods. The 
final chapter of Part 3, Chapter 11, is concerned with processing methods 
other than those used in numerical or logical analysis, which are, never- 
theless, of fundamental importance in business and other activities 
requiring large-scale information handling and data reduction. In addi- 
tion to techniques for searching and sorting, examples of methods for 
redundant (error-correcting) and irredundant (overlapping) coding are 
also considered. 

Possible Course Structures. This text can be used in courses at the 
sophomore, junior, or senior college level or at the first-year graduate 
level, in several different ways—as a one-, two-, or three-semester course. 
Although undoubtedly each instructor will have his own personal prefer- 
ences, Table 1 suggests the chapters and/or sections that could be covered 


Mophomore...| Chaps. 1, 2; Chaps. 1 to | Chaps. 5 to| Chaps. 1 to 
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in such courses. I believe that even in an introductory (one-term) 
sophomore course, a good deal of automatic programming should be 
included, so that the student may view the concepts of the modern pro- 
yramming field with better perspective. It may be wise also to introduce 
(he junior or senior to other than general-purpose programming, through 
1 consideration of the digital differential analyzer. In the more leisurely 


Tas_e 1. SucaEsTED CouRSE OUTLINE 
$< 


Two-term course Three-term course 





One-term 
course 


Year 
Term 1 Term 2 Term 1 Term 2 Term 3 





Chaps. 5 to | Secs. 4-4 to 
Secs. 3-1 to | 3;Secs.4-2,| 7; Secs. 8-1 | 3; Secs. 4-2,| 7; Secs. 8-1 4-6; 
3-3, 3-7; 4-3 to 8-3 4-3 to 8-3 Secs. 8-4 to 


8-8; 
Chap. 6; : 
Secs. 8-1 to Chaps. 9, 11 


8-3 
Junior.......| Chaps. 1, 2;| Part 1¢ Part 2¢ Part 1¢ Part 2{ Part 3§ 
Secs. 3-1 to 
3-3, 3-7; 
Sec. 4-2; 
Chaps. 5, 6; 
Secs. 8-1 to 
8-3 
Monior...... Chaps. 1 to 3; | Part 1f Part 2{ Part 1¢ Part 2t Part 3§ 
Sec. 4-2; 
Chaps. 5 to 
7; Secs. 8-1 
to 8-3 


Virst year Chap. 1 (for | Part 1; Chap. 10; 
uraduate home study) ;|_ Chaps. 9, 11] Part 2 
Chaps. 2 to 
4; Chap. 5 
(for home 

study); 
Chaps. 6 to 8 


a 
| Part 1 includes Chaps. 1 through 4. 

{| Part 2 includes Chaps. 5 through 8. 

§ Part 3 includes Chaps. 9 through 11. 




















two- and three-term courses, further attention can be devoted to special- 
purpose computers. In the three-term courses, the third ten might be 
given concurrently with the second term as a course entitled Introdue- 
tion to the Utilization of Digital Computers.” An alternative to the 
suggestions given in Table 1 is a one-term course entitled “Introduction to 
Automatic Programming,’ where Chapter 1 could be covered (if the 
students have had no other association with digital computers) and then 
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immediately Chapters 6, 7, and 8 could be covered. The advantage of 
such a course is that in a single term the student would gain knowledge 
enabling him to grasp thoroughly automatic-programming languages 
used on commercially available computers, without having to wade 
through the details of machine-language programming; on the other hand 
the disadvantage is obvious, that the student would be required to aes 
on faith alone the fact that such automatic-programming languages can 
be realized in terms of machine-language instructions. 

Special Technical Features. Parts 1 and 3 of this book are basically 
excerpts of material in the author’s engineering text.{| Though this 
previous text included much material of value to programmers, the pub- 
lisher felt that programmers would not be particularly interested in 
detailed methods for the logical and electronic-engineering design of a 
computer. These parts of the engineering text were eliminated to make 
room for a greatly expanded treatment of the subject of primary concern 
to the modern computer programmer—automatic programming. Chap- 
ters | to 4 of the present book correspond, with some rearrangements and 
deletions of material, to Chapters 1 to 4 and 8 of the engineering book but 
include a new discussion of modern memory-access methods. To Chap- 


Aes op ' , ; 
lor b of the engineering book I have added a section on data-handling 
routines, including a discussion of threaded and push-down lists; also the 
dincuimaion of algebraic compiling routines has been largely rewritten 


The present chapter on numerical analysis includes a new discussion of 
“differences” and partial differential equations. Chapter 10 of this text 
In greatly condensed version, for programmers, of the material contained 
in Chapters 10 through 14 of the engineering text, with the primarily 
engineering applications eliminated. Chapters 6 to 8 are entirely new 
presen ting material which to my knowledge appears in book form for the 
first time. 

Special features of Part 1 include the use of a four-address system for 
introducing the basic ingredients of an instruction, and the discussion in 
Section 3-5 of the relative, indirect, augmented, and compound memory- 
addressing techniques. The specific method of Section 4-3 for coding the 
control computer was developed by the author, and the method for coding 
the logistics and business computer, appearing in Section 4-5, is based on 
the work of W. H. Marlow. 

In Part 2, I present an introduction to the two automatic-programming 
languages that are becoming internationally accepted: ALGOL and 
COBOL. Sponsored by the Association for Computing Machinery and 
several foreign societies, the universal acceptance of ALGOL would 
undoubtedly have a great beneficial effect on the national and interna- 


R. Ss. Ledley Digital Computer and Cont: 0 4 neerir Me ( iraw- Hill Book 
’ r I n 
Company, Ine New York 1960. | ginee 1 1g, 
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tional exchange of ideas and methods of programming. The discussion of 
ALGOL also contains the primary features of other automatic programs, 
such as FORTRAN, GOTRAN, FOR TRANSIT, Honeywell Algebraic 
Compiler, Phileo 2000 ALPAC, and so forth. 

‘That the general acceptance of COBOL could produce extensive savings 
in the accounting and legal applications of computers has received wide 
recognition in business and industrial circles. In fact, some agencies of 
the United States government are requiring the preparation of a COBOL 
language by commercial computer manufacturers for computers to be 
used for such purposes. 

Also included in Part 2 are several special features: the discussion of 
algebraic compilers (Section 5-8), based on the work of R. W. Floyd; the 
description of automatic-programming translation (Sections 8-2, 8-3), 
hased on an idea of E. T. Irons and given in a formulation by the author 
and J. B. Wilson; and the treatment of computer aids to medical diag- 
nosis (Section 8-5), based on the work of the author and L. B. Lusted. 
The discussion of heuristic programming and the general problem solver 
(Sections 8-6, 8-7) is based on articles by H. L. Gelernter and N. Rochester 
and by B. A. Newell, J. C. Shaw, and H. A. Simon. The utilization of 
ihe automatic-programming routine and the general problem-solver 
routine for concept recognition (Section 8-8) is the result of the author’s 
original investigations, and the discussion of creativity is based on parts 
of a book by L. A. Hiller and L. M. Isaacson. 

Special features of Part 3 include J. H. Wegstein’s general method for 
accelerating the convergence of iterative solutions to equations (Section 
2), the author’s Tabledex method, and his techniques for searching 
with relaxed conditions (Sections 11-2, 11-4). A new, simpler version, 
due to W. R. Smith, of the exact formulas for ‘Srredundant”’ (overlap- 
ping) coding has here (Section 11-7) replaced the more cumbersome 
formulas given in the engineering text. The computational methods for 
eonstraints in a Boolean algebra, logical dependence and independence, 
the solution to Boolean equations, the method of antecedence and con- 
wquence solutions, and the fundamental Boolean matrix formulas given 
i Chapter 10 are the author’s original developments. W. R. Smith 
developed the general solution to the logical matrix equation (Section 
0-14). J. B. Wilson contributed the decimal simplification method 
(Mection 10-7), and R. D. Elbourn deserves credit for the direct. method 
for finding prime implicants (Section 10-8). 
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PART 1 


MACHINE LANGUAGES 


CHAPTER 1 


APPLICATIONS AND DESCRIPTION 
OF A DIGITAL COMPUTER 


1-1, Introduction 


Another Great Evolutionary Step in Mankind’s Capabilities. The intro- 
(uction of agriculture revolutionized ancient man’s social, economic, and 
Wiultural potential. This was the first great step in the evolution of civili- 
yilion, The more recent Industrial Revolution, vastly increasing man’s 
productive capabilities, was the next great step and brought forth our 
present highly mechanized economic and interdependent social civili- 
yilion. We are now on the threshold of still another new kind of revo- 
liilion, based on machines that greatly increase man’s “thinking” capa- 
hiilities of planning, analyzing, computing, and controlling. Even though 
(ho digital-computer field has existed for scarcely a decade, many thou- 
wiids of computers are already in daily use, penetrating almost all phases 
af our modern society, from nuclear-energy production and missile design 
\0 (he processing of bank checks, business invoices, and medical diagnoses. 
Thin new and unique potential of mankind will have a marked influence 
ol only on the physical sciences and technology but, perhaps even more 
simnificantly, on the biological sciences and the economic, political, and 
sell aspects of our civilization. The changes that will evolve from this 
jwweat of mankind’s potentialities are difficult to foresee, but they can be 
even greater than the spectacular changes occurring in our civilization 
from 1860 to 1960—from the beginnings of the Industrial Revolution to 
fodern times, 

The concept of an evolutionary development of civilization as a con- 
uation of biological evolution has been emphasized by the noted biolo- 
“ist Julian Huxley and can be clearly seen in the great dependence and 
ellance of modern industrial and technological capabilities upon the 
heritage of past generations, The development of computer capabilities 
slvongly displays such evolutionary characteristics. The use of com- 
jiilors to design more powerful computers has already begun, Hardly a 
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calculation is presently planned that does not utilize some previously ; . 

written computer program. And so it is also with the evolution of con- <j Pe S SN 

cepts and ideas for utilizing computers in more and more complex ways, N2ES Xe eS Qu 

to handle such complicated control problems as factory, aircraft, or e 63" < \z g 
re 


defense-systems control, to prove mathematical theorems, to simulate 
complicated many-variable situations, such as military tactics and 
strategy or economic planning and development, ete. To quote Julian 
Huxley :t 
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If we hope to sum up the situation epigrammatically, we can say that, with the => 
advent of man, conscious purpose was able for the first time to exert an operative Pa 
effect on evolution; and that through the new knowledge acquired during the last — 
hundred years, the evolutionary process was able for the first time to become 
conscious of itself. The next decisive step in evolution will accordingly be the 
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We cannot claim that this tree is complete, but we hope that the major trends 





fuller development of that self-consciousness. In particular it will be the con- a7 3 a 
scious clarification of the future in the light of the long evolutionary past and of —s <s he s 
the study of human potentialities. Once this has been done, greater realization - = a3 
of possibilities rather than higher productivity, fulfillment rather than efficiency, = —° oe Q 8 
will become the overriding aim. = ees 2 a 
——— = 4 % 3 w 

Historical Notes. The first digital electronic computer with a high- —s ee g es 

speed electronic memory was successfully operated a little more than ee eA = as? 
— 7 = o 

ten years ago. However, the concept of a computer dates back many —K —x —|5 = Be 
centuries. John Napier’s celebrated ‘bones,’ described in 1617, assisted + ae a << Ts 
in multiplication. In 1642 (the very year of Newton’s birth), Pascal —i— tw + 38 
invented a mechanical computer that utilized a carry for adding; and in C2 x 3 a” 
1671 Leibnitz described a machine that could multiply by successive < —o = a ‘ 
additions. The Jacquard loom (1801), which was evidently an out- <= 5 S 5 S 
growth of several inventors’ contributions, is memorable in that it uti- 5 A 3 84 B, 
lized punched cards to control various weaving patterns. But Charles = i™.-3 Os ES \\ S 
Babbage is probably the one pioneer most deserving of credit for develop- pitt 7: /2 3 
ing many of the most important basic ideas utilized in modern computer a “eo 8/, ad a7 ) =) 
concepts. Between 1834 and 1854 he worked on his “analytic engine,” ae fav JS = 
which utilized gears for the calculations; the machine was, however, never 3—s J = 2 
completed. These are some of the ancient historical roots of the com- —f]-7 ce = 
puter field, which led eventually to the conventional desk calculating EA ‘6 ) 3 
machine and the common punch-card machine (see Fig. 1-1). & 37) << s 

It was not until the early 1940s that there arrived the true precursors Zi ey) 4 
of the modern computer. A series of relay computers were made at the re) a rf 
Bell Telephone Laboratories and at Harvard University. The ENIAC, Ps Q ony A: 
built at the University of Pennsylvania’s Moore School of Electrical aS Ss 38 
Engineering from 1942 to 1945, was the first computer to utilize elec- 4 —— “5 

} Julian Huxley, Evolution and Genetics, in James R, Newman (ed,), ‘‘What Is a Fs E 


Science?’’ p. 289, Simon and Schuster, Ine,, New York, 1955, 











Fig. 1-2. Present-day computers. (A) IBM RAMAC 1401 data-processing system}; 
(B) Honeywell 800; (C) Datamatic 1000; (D) UNIVACG solid-state computer; (2) 


UNIVAC 1105; (F) RCA 501; (@) air-borne computer; (/7) RVS-Mark I (an auto- 
matic order-filling system), [Courtesy (A) International Business Machines Corp.; 
(B, C) Minneapolis-Honeywell; (D, E) Remington Rand; (1) Radio Corporation of 
America; (G) Libraacope, Tne,; (11) Induatrial Electronica Engineers, Tne.) 
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hin, 1-2. Present-day computers (continued). (I) IBM 709; (J) I : 

? : y ie p ed). 9; ( BM 705; (kK 
HM (50; ( L)I orseus G/7578/1; (M) Burroughs 205; (NV) NCR 304; (0) aca 
" 4000; CU ) Bendix G-15. [Courtesy (1, J, K) International Business Machines Corp.; 
{ y ) Merranti, Lid.; (M) Burroughs Corp.; (N) National Cash Register Co.; (0) Philco 
Corp.) (2) Bendix Computer Division, | 
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tronic tubes. But these computers did not yet embody all the basic 
principles of modern computers. 

The precursor stage was followed by the developmental stage, which 
lasted from 1946 to about 1952. An important event in computer his- 
tory was a course given at the University of Pennsylvania Moore School 
in July and August of 1946. Here were described the basic concepts of 
the EDVAC, the design of which was being initiated at the Moore School. 
The EDVAC encompassed the major design features of modern com- 
puters. The design of the SEAC at the National Bureau of Standards 
and the EDSAC at Cambridge University, both of which computers 
were successfully operated even before the EDVAC, can be traced 
directly to that course. The IAS computer built at the Institute for 
Advanced Study can also be traced to the Moore School, as can proba- 
bly many of the features of the Whirlwind computer built at MIT. 
But the SEAC, built from 1948 to 1950, was the first modern electroni¢ 
computer in this country. Other pioneering computer efforts were suc- 
cessfully completed a few years later, and the fourth, or commercial, 
stage of computer history was initiated (see Figs. 1-1 and 1-2). 

The names of 8. Williams and G. Stibitz of the Bell Laboratories; 
H. Aikin of Harvard University; J. Eckert and J. Mauchly of th 
Moore School, University of Pennsylvania; M. V. Wilkes of Cambridg' 
University, England; J. Curtiss, S. N. Alexander, R. Slutz, and 8. Lubki 
of the National Bureau of Standards; J. von Neumann, H. Goldstine, an 
J. Bigilo of the Institute for Advanced Study; J. Forrester of the Mass 
chusetts Institute of Technology; N. Rochester and J. A. Haddad of th 
International Business Machines Corp., among many others, are out 
standing in the computer field. In a sense it is unfair to mention onl 
these names, for there are so many who have contributed their geniu 
to the development of modern computers. In fact, it is probably one o 
the unresolved tragedies of modern technology that the results of indi 
vidual genius are not long remembered as associated with that individu 
but are left to blend unidentified and unrecognized in the forgotte 
chronicles of these rapidly changing times. To do true justice to thes 
brilliant scientists would require extensive research in itself. 

The Role of the Programmer. In general, a computer can be conceive 
as a numerical-transformation machine. Numbers are the inputs to it 
and the computer transforms these numbers into new numbers, whic 
appear as the outputs (see Fig. 1-3). For instance, the input numbe 
may be the initial conditions of a differential equation, and the outpu 
numbers will be a table of the functional solutions; or the input number 
may be raw experimental data, and the output numbers will be th 
parameters determined by the experiment; and so forth, However, the 
computer must be directed to perform the required computations, and 
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Data ————————»__ input numbers 
Output 


Computer 
. numbers. 


Programmer ————— instruction list 
Fig. 1-3. Data flow to computer. 


hence, in addition to the input numerical data, there must also be input 
instructions, which are coded in the form of numbers. Such a coded list 
of instructions is called a program, or code; the preparation of this pro- 
rum or code is the responsibility of the computer programmer. The 
purpose of this book is to describe the concepts, methods, and tech- 
iiiques involved in the preparation of a program, or instruction list, for a 
high-speed electronic digital computer. The role of the computer pro- 
urammer is to utilize these methods and techniques to translate the 
yerbal, mathematical, or other description of a computational problem 
lilo a list of instructions that will direct the computer to perform suc- 
wounfully the required computations. 

A word should be said about the kinds of electronic computers. There 
ive two main kinds, called analog and digital. In digital computers the 
jiimbers (digits) themselves are handled explicitly by the computer; an 
(wulog computer deals instead with a representation of the numbers, for 
liimlunce, voltage, length, etc. In this book we are concerned with pro- 
“rumming for digital computers only. These can be classified as general- 
purpose and special-purpose computers. Most of this book, however, 
with the exception of Chap. 4, is concerned with general-purpose com- 
jiilors, because a knowledge of these computers is a prerequisite for an 
derstanding of the special-purpose computers. In Chap. 4 we shall 
tonsider some of the problems that arise in programming for special- 
jiirpose computers. 

Chapter Outline. In the present chapter, we shall first consider appli- 
talions of computers and then proceed with the principles of digital- 
‘omputer programming. We hope that by presenting brief but rela- 
lively detailed descriptions of a few specific applications we may impart 
(0 the reader a better insight into how digital computers are used. 

lor the purposes of this chapter we have arbitrarily classified computer 
Applications into four categories; many applications can fall into more 
(hun one of our categories. First, we shall consider numerical solutions 
{0 equations. This represents an important class of applications, and 
idoubtedly it is these applications which makers of the first computers 
liu in mind, The second category is process control: here the need for 
igital computers with specialized capabilities becomes apparent. It is 
probably not an exaggeration to say that the third category actually came 
into being because of the great capabilities of the computers: this is 
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simulation. Before the advent of computers, detailed simulations were 
rarely even discussed because they were clearly not feasible to perform. 
However, with the use of high-speed computers, simulations have become 
practical, opening new fields of scientific research. Many of the present- 
day computer applications can probably be classed in the fourth category, 
data processing: Business accounting, statistical reductions, information 
retrieval, and many other important but routine procedures can be 
handled by computers. 
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Fic. 1-4. Announcement of Argus shots. 


1-2. Numerical Solution to Equations 


Motion of Charged Particles in the Earth’s Magnetic Field. Project 
Argus was a major scientific and military experiment conducted by the 
United States (Fig. 1-4). Rockets carried a small atomic bomb more 
than 300 miles above the earth, at this altitude the bomb was exploded, 
and a resulting thin layer of radiation spread quickly round the globe. 
Nuclear and atomic particles were propelled by the exploding bomb and 
streaked through the vacuum of the universe under forces almost entirely 
due to the earth’s magnetic field, Some particles moved toward the 
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ourth’s poles and caused artificial auroras (northern and southern lights). 
Ii, order to measure the characteristics of this radiation layer, an earth 
atellite and other rockets were launched with paths passing through the 
layer (see Fig. 1-5). Instruments in the satellite and rockets transmitted 
ensurements back to earth. Scientifically the shots rank among man- 
kind’s foremost experiments. They showed that radiation introduced 





liq. 1-5. Earth satellite measuring characteristics of radiation layer. 


into space by man can have significant effects over those of natural 
radiation, Militarily the Argus shots indicated that the additional radi- 
ation added to the natural radiation interfered with man’s electronic 
fPommunications. 

he Argus project tested theoretical calculations of what would take 
place when the small atomic bomb was exploded above the earth’s atmos- 
phere, This was a problem in the study of the motion of charged parti- 
clon in a magnetic field (the earth’s) and is in principle, except for the 
shape of the field, the same as the problems encountered in magnetically 
directing an electron beam in a television tube, or accelerating particles 
jy a betatron, or circulating particles in a cyclotron, ete. If the earth’s 
Wugnotic field is represented as that of a magnetic dipole, and if x, y, and @ 





10 MACHINE LANGUAGES 


{[Cuap. 1 


are cartesian coordinates with the z axis directed along the earth’s mag- 
netic axis from south to north, then from electromagnetic field theor 
it can be shown that the trajectories of the charged particles are given by 
the solutions to the following differential equations (where the super- 
script dots denote differentiation with respect to time): 


_ Ma 


= i? + oy? + 2?) (Qe? — a? — y*)y — Byezd]} 
g = MO f(t yt + Ce + 7? — 229K + Baza]| 
2 = ET Laake $ yt +e + + Ve ty $24] 


where k = (xy — y%) — (a? + y?)(a? + y? + 2”)-*% 

earth’s magnetic dipole strength 

relativistic mass of particle 

= charge of particle 

= speed of light 

The initial conditions are the initial coordinates (ao,y0,20) of the particl 
and the components (40,%0,20) of its velocity just after the explosion. 

These equations can be used to plot the paths of the charged particle 
given their initial velocities and directions. The distribution of th 
initial velocities and directions can be calculated from nuclear physic 
the trajectories of different kinds of particles can then be compute 
for many different initial conditions in the distribution. The complexit 
of these computations requires the use of a digital computer. The basi 
importance of such a calculation preliminary to the actual experiment i 
evident: by predicting the motion of the particles, proper and adequa 
plans can be made concerning the instrumentations of the satellite an 
the measurements to be made, the best geographical location of the expe 
ment, the optimum altitude for exploding the bomb, and so forth. 

Solutions to Equations in Biomedical Research. The partial differenti 
equations that describe biological phenomena more often than not def 
analytical solution. Attempts are usually made either to introduce si 
plifying assumptions or to isolate special aspects of the problem for inve 
tigation. Both these alternatives involve certain compromises on t 
part of a researcher. Such compromises can often be avoided, however 
by using an electronic computer to solve the original partial differenti 
equations numerically. 

Computers are being used to assist in the complicated and extensivi 
computations that are frequently involved in obtaining information abou 
the precise atom structure or the over-all size and shape of crystallizabl 
molecules, from X-ray diffraction patterns. Various organic molecul 
have been and are being analyzed by this method, including the well 
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known cases of vitamin B,2, myoglobin, hemoglobin, lysozyme, and the 
hydroxyapatite of teeth and bone. These methods are also being applied 
in analyzing the structures of amino acids and peptides and the over-all 
shape of nucleic acids and other large molecules. For nonerystallizable 
substances, information about their structures can be obtained from 
scattered-light interference patterns, osmotic-pressure and viscosity meas- 
urements, and sedimentation rates. Here the computer could again sig- 
nificantly assist in the computations involved. 
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(c) (b) 
fiw, 1-6. Example of radioactive tracer study. (a) The five “compartments” of a 
\hyroid tracer study; (b) typical measured levels of radioactive iodine in the com- 


juriments; (¢) the set of five linear differential equations giving the dependence of the 
ale of change of amount q; of radioactive iodine in compartment 7, where dj; 1s the 
fractional turnover rate of radioactive material from compartment 7 to compartment 7. 


Another relatively active field in the use of computers is the study of 
(iferential equations relating to the mechanisms of biological systems. 
Tho famous Hodgkins-Huxley equations, and several variations thereof, 
juve been analyzed with respect to the properties of excitable membranes. 
Hiwsides the computations, well under way, concerning the nature of 
orve-fiber conduction, other studies will apply these equations to muscle- 


(ber excitability. Other equations, relating to semipermeable mem- 
vanes and the diffusion and motility of large molecules, could be solved 
uumerically on the computer for complicated geometries and other speci- 
fieations. This capability presents a further tool in the study of the 


wature of the physical phenomena involved in intracellular dynamics. 
Complicated sets of interrelated differential rate equations can in 
juny cases be solved on the computer, Such computations have been 
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applied to multiple-reaction-rate problems involved in the exploration of 
certain metabolic processes. Computations involving such sets of equa- 
tions have been used to predict intermediate reactions from a knowledge 
of experimentally determined final reaction rates. Another important 
rate-equation problem to which computers have been applied is that of 
tracing through the various organs and systems of the body the time 
course of injected radioactive substances, for diagnosis of metabolic dis- 
turbances. For example, if radioactive iodine is used for a test of hyper- 
thyroidism, the amount of radioactive substance would be measured as 
a function of the time in such “compartments” as the blood iodide, the 
protein-bound iodine, the thyroid gland, the feces, and the excreted urine. 
From these data the appropriate fractional-turnover rate equations are 
solved for the turnover-rate constants between compartments, these con- 
stants being the desired indices of disease (see Fig. 1-6). Facets of this 
problem are still being worked out, such as uniqueness of results, con- 
sistency of redundant information, statistical variation, and so forth. 
Radioactive-carbon-tagged chemicals can be used in this manner for pin- 
pointing specific dysfunctions of the metabolic pathway, opening up new 
fields of research in computer-aided specific diagnostic analysis. 


1-3. Process Control 


Machine-tool Control. Significant improvements in many manufac- 
turing processes can be obtained with the application of digital data- 
processing and control techniques. An example is digital machine-tool 
control, used to improve the over-all process of producing many different 
and complicated parts needed in our modern technology. Greater accu- 
racy, reproducibility, and versatility are obtained as tooling costs and 
skilled-manpower requirements are reduced. It increases our capabili- 
ties to produce greater quantities and varieties of precision-machined 
parts, thus revolutionizing their potential use. 

Consider a milling machine cutting a two-dimensional contour. The 
cutting tool is moved in the x and y directions by means of high-perform- 
ance hydraulic servo drives. These servo drives are controlled by 
rapidly sequenced pulses that tell the servos precisely where to move the 
tool in order to obtain the contour finally desired. For instance, each 
time the x-direction control receives a pulse, it might advance the tool 
0.0002 in. in the x direction; if we assume that there can be up to 20,000 
such pulses per second, the tool can be made to advance in the x direc- 
tion a maximum of 0.0002 X 20,000 = 4 ips = 20fpm. By varying the 
number of pulses per second, up to the maximum, the speed with which 
the tool moves in the x direction can be precisely controlled, The con- 
trol of the y-direction motion of the tool is similar. ‘Thus the tool can be 
directed to move round any desired contour, ‘The precision of machining 
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the desired part is of course limited by the smallest increment of distance 
of the control. 

‘The problem of digital machine-tool control reduces to that of trans- 
forming information on a blueprint design into an appropriately timed 
sequence of 2 and y input control pulses. The control pulses direct the 
vonter of the cutting tool, but the desired contour is formed by the edge 
of the cutter; hence the proper cutter offset must be calculated from the 
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Fra. 1-7. Cutter offset path. 


hlueprint. See, for example, Fig. 1-7, where the circles indicate some of 
(he successive positions of the cutter. If we assume in Fig. 1-7 that the 
“raph lines are 0.0002 in. apart, then the dots represent the successive 
positions of the offset path of the cutter’s center that should be directed 
hy the control pulses. (For example, the diagonal cut would be made by 
(wo pulses in the y direction for each one pulse in the a direction.) Of 
eourse, in a realistie case the part would be many times the size illus- 
inated, and hence the number of input pulses required would be very 
larwe. Thus it would be a difficult job to start from the blueprint and 
determine manually the detailed sequences of control pulses. The 
jwethod of interpreting the blueprint design for the automatically con- 


-ipolled milling machine is divided into two steps. First the cutter’s offset 


jth ix determined grossly in terms of the end points of straight-line path 
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segments, radii of circular contours, and so forth. This in itself can 
present difficult problems of curve fitting. The second step is to inter- 
polate between these points and from the radii, etc., to determine finally 
the precise 0.0002-in. input-pulse fine structure. 

Two computers are used for this process. The first is a general-purpose 
digital computer into which are put the specifications for the part contour 
taken from the blueprint. These might take the forms “diagonal at 
arctan 2,” “straight 4 inches,” “horizontal 3 inches,” “‘radius 2 inches,” 
etc. Or the input to the digital computer might be the blueprint itself, 
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computer 
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Fig. 1-8. Machine-tool control process. 


put into the computer by means of a “picture-reading”’ machine. The 
computer is programmed to compute from such input data the gross 
data of the offset cutter path. This is perhaps put out in the form of 
punched paper tape, like the rolls of a player piano, or in the form of a 
deck of punched cards. The second computer is used to control the 
milling machine directly. The punched paper tape or card deck is put 
into this control computer, which interpolates, etc., to produce the fine- 
structure input pulses to the servo drives. The entire process is illus- 
trated in Fig. 1-8. 

Control of Chemical Plant. In many factories the processing of raw 
materials frequently involves sequences of decisions for which all alterna- 
tive possibilities are known. By means of digital computers the entire 
processing can be made completely automatic. Control of this kind 
has the advantages that the decision processes themselves become more 
reliable when performed electronically and that the design of more com- 
plicated processes becomes feasible, As a specific example, consider a 
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simplified version of a chemical factory that produces Al,O; (see Fig. 
\-)). Table 4-2} summarizes the conditions which initiate various 
actions in the plant. The column entitled “Sensed-signal conditions” 
represents the list of conditions requiring some operation to be performed. 
he column entitled “Corresponding signals to be generated”’ indicates 
ihe actions to be taken. 

Input signals to the control computer are transmitted by wires from 
vritieal places in the factory (see Fig. 1-9). These signals, defined in 
‘able 4-1, consist essentially of two-valued (on or off) test results which 
jy combination tell the computer the present state of the factory. Yor 
oxumple, signal L(1) will be a pulse when the mixture in mixing tank 1 is 
al or above a predetermined level and no pulse if the mixture is below 
this level. Each signal is initiated by a transducer, a device that con- 
verts mechanical information into electrical information or vice versa. 
hoe transducers for our example convert fluid levels in tanks, tempera- 
j\ires, chemical concentrations, volumes of precipitate in a centrifuge, 
pressures, and so forth, into the electrical pulses that are the inputs to 
ihe computer. The outputs from the control computer are electrical 
signals supplied to transducers (servomechanisms) that open and close 
valves or hatches and otherwise operate to change the state of the fac- 
tory. Table 4-2 shows under what conditions such changes of state 
will be initiated. 


1-4, Simulations 


Industrial Dynamics. The term industrial dynamics refers to the 
processes and factors influencing the economic development of an indus- 
ivial corporation considered as an interacting part of a free competitive 
eeonomy. The study of industrial dynamics involves so many factors 
ihat no complete analytical mathematical analysis is yet possible. 
Many universities in the United States, in cooperation with many 
jndustrial corporations, are attempting to simulate aspects of the indus- 
iriul-dynamics problem. Representatives of management take part in 
sich simulations, making decisions; the digital computer then computes 
iho effects of these decisions and produces a report to the management 
representatives, who make new decisions, etc. (see Fig. 1-10). One 
advantage of such simulations is that they aid in training management 
personnel by presenting them with many and different industrial circum- 
stances analogous to real situations. A second advantage is that the 
simulations aid the study of industrial dynamics by bringing before the 
sclentists involved situations not encountered through purely academic 

{The control of this particular process will be considered in greater detail in Sec. 


14) it- suffices here to outline briefly the kind of input information supplied to the 
somputer and the way the computer then controls the factory, (See pp. 132 to 137.) 
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Fig. 1-9. Control of chemical plant. 
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aludy. And, third, important generalizations can be deduced from the 
progress and results of the simulations. 

As an illustration, consider a multifirm one-product competitive indus- 
iry. The simulation is designed in such a way that the results of 5 years 
of operations can be simulated in 1 day. Each firm in the simulation is 
represented by a team of executives, who must make the following set of 
‘inanagement”’ decisions: 

|. Price of product 

2. Producing volume 

3. Advertising expenditures 

|. Research and development expenditures 

5, Amount of new investment in plant and equipment 
Dividends to be paid, etc. 

One set of such decisions is made by each team firm for each quarter 
of w year. The computer then computes the interactions and effects 
of these decisions on the market; i.e., it simulates what may happen in 
roulity and produces for each team a report giving the following: 

|, Sales volume 

2. Per cent of total products sales handled by this firm 

4. Production capacity of this firm for the next quarter 

|. Statement of profit and loss 

5, Current inventory quantities 

(, Statement of receipts and disbursements 

7, Statement of financial conditions, ete. 

On the basis of study of its report, each team makes decisions on how 
the firm will operate in the next quarter. These decisions are fed into 
(he computer, and the simulations progress in this way from quarter to 
quarter, 

‘he computer processes the decisions made by the teams (firms) and 
produces reports by means of mathematical models. The model is 
mmontially a set of hypothetical equations relating the input information 
(0 the desired output information. For example, increased advertising 
hours a functional relation to increased sales, but this in turn depends 
un the advertising decisions of competitive firms. Increased research 
and development can produce an improved product which may give 
one firm some advantage. Price rises of course offset sales, but adver- 
lining can to a certain extent offset an increase in price. Obviously the 
rows profit is the product of sales volume and price per unit product, 
iiinus administrative costs, overhead, manufacturing costs, advertising, 
fowoarch and development, ete. The mathematical model embodying 
sich considerations can become very complicated, depending on the level 
it which the simulation is intended. Evidently a great deal of study 


and research must be carried out in connection with the derivation of 
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the specific equations that enter into the computer simulation. For a 
simulation to be at all realistic, the complications will be so numerous 
as to be impossible to perform without digital computers. 

Man-Machine Simulation System. Man-machine simulation is now 
recognized as an essential tool for the development of complex automatic 
systems that involve both electronic and manual controls. Such is the 
case, for example, in air-traffic control, ground control of interceptors, 
ground-controlled approach and landing, weapon assignment, or missile 
guidance systems. The interactions between the electronic control sys- 
tem being designed and the team of human operators can be studied 
before the system is actually constructed. This eliminates the only other 
alternative, the frequently prohibitively expensive process of actually 
building the proposed system and carrying out an extensive program of 
tests and modifications. 

Figure 1-11 represents a simulation system that includes both a digital 
and an analog computer, linked with display and control equipment 
which permit human beings to be coupled into the system. The analog- 
computer equipment is used for such tasks as accepting continuous con- 
trol information and solving complex dynamic equations in real time, 
for which they are suited. The digital computer is used for controlling 
the experiment, for sequencing the stages depending on the outcome 
of each stage, for generating simulated data, for calculating with high 
precision where necessary, for handling variables with large dynamic 
ranges, for making logical decisions, and for statistically analyzing 
experimental results. 

An example of a simulation is a simplified ground-controlled inter- 
ceptor system (Fig. 1-11). The digital computer simulates a situation 
in which a large number of aircraft are observed by the search radar. A 
console displays the locations of these aircraft on a cathode-ray tube 
for the ground crew; with each aircraft is also displayed an associated 
identification (i.e., friendly, enemy, unknown, interceptor, bomber, com- 
mercial, etc.). The ground crew manually assigns targets to the inter- 
ceptors; the targets are automatically tracked by the digital computer. 
A model of an interceptor cockpit fully equipped with instruments and 
pilot flying controls enables a human operator to imitate the actions 
of the interceptor pilot in flight. The pilot of the interceptor receives 
his flight-control information either directly from the digital computer 
or from the ground-crew commands. The pilot’s control movements are 
interpreted by the analog computer, which then simulates the dynamics 
of the interceptor, operating the instruments in the cockpit. The veloe- 
ity components of the interceptor are sent to the digital computer, 
which computes the path of the interceptor. The digital computer also 
displays the computed positions of both interceptor and target on a 
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monitor scope for the ground crew. It is this display which gives the 
ground crew its information for the simulated data link with the pilot. 
The research control console enables a research team to set up the initial 
conditions, vary the target aircraft maneuvers, and determine the inter- 
ceptor closure tactics. Records are kept of information generated by the 
interceptor pilot, analog computer, and digital computer during the 
experiment. These can then be analyzed on the digital computer after 
the experimental simulation has been completed. 


1-5. Data Processing 


Inventory Control. The annual industrial inventory of the United 
States has been estimated at 30 billion dollars; the cost of supporting this 
inventory is probably about 3 billion dollars annually in interest lost, 
tied-up money, depreciation, obsolescence, spoilage, storage, handling 
charges, and so forth. It has been estimated that the use of computer 
control and information-processing methods could reduce this inventory 
and its associated cost of support by more than 25 per cent. Figure 1-12 
indicates how a computer could be used in an inventory-control system. 
Here the inventory files are electronically recorded in the computer. 
Inputs to the computer are items initiated outside the inventory records, 
such as receipts, issues, orders, shipping reports, etc. These inputs indi- 
cate the alterations in the inventory records that must be made by the 
computer. The outputs of the computer are items that arise from an 
examination of the inventory records, such as the need for more parts, 
orders to restock, bills or invoices for items shipped, accounting records, 
and other data about the current inventory records that may aid manage- 
ment in planning, etc. The analysis of an inventory-control problem for 
the use of a digital computer includes the preparation of a complete 
report on all the processing requirements, process charts of the data flow, 
and definitions of the controls. These reports and charts correspond to 
the equations describing a scientific problem. 

Consider, for example, a concern that manufactures and sells certain 
products, and suppose that the company keeps the following files (see 
Fig. 1-12): (a) a customer master file that contains all customers’ names, 
addresses, purchases, balances due, credits, ete.; (b) a products inventory 
master file that contains the kind and number of each product in stock 
as well as its location, price, and other identifying descriptions and num- 
bers; (c) a factory parts and raw-materials inventory file that contains 
a list of all parts and materials stocked that enter into the manufacturing 
process, together with descriptive identifications and numbers; and (“) 
a suppliers master file that contains all suppliers’ names, addresses, sup- 
ply records, balances due, credits, orders not yet received, and so forth. 
The procedure by which the computer keeps such files up to date and 
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yenerates information from them will now be described; the reader should 
keep in mind that our intention is to give a gross idea of the problems 
involved, and we have hence oversimplified the circumstances. More 
yonlistically, these problems are characterized by huge masses of detailed 
eonsiderations. 

‘he customer file would be updated each time an order, payment, or 
yolurn was received from the customer. At the same time the computer 
would produce the necessary pricing and shipping information (i.e., cus- 
lomer discount, location, etc.) if an order was made, or the customer’s 
yolund order if a return was made. The products inventory file would be 
updated each time a report was received of shipping or completing a 
product. If a product was to be shipped, the computer would produce 
the product description number and warehouse location, and it would 
leo produce an order for more products to be manufactured if the ship- 
went reduced the inventory below a predetermined level. The computer 
man also produce the invoice for a sale from the proper information 
yocvived from the files. The file of parts and raw materials used in the 
iunufacturing processes would be updated each time a report was made 
of parts and materials used, returned, or received. The computer would 
ietermine what replacement orders and returns should be made as well 
is update the inventory. The supplier file would be updated each time a 
Will was received or report of return or receipt was made. The computer 
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> 2 would produce the payment orders to the supplier. Finally the computer 
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airline, every minute hundreds of inquiries about seats available, actual 
wiles, cancellations, and unsatisfied demands must be processed. Each 
\iwket office in the United States of a particular airline must be able to 
\nlerrogate the same centrally located computer concerning seats avail- 
able on particular flights and to make reservations on particular flights; 


(he processing of each query must be completed within a few seconds. 
aol|s 2l|s 4/(22 Esl] (ee The main advantages of such a system are that the customer is provided 
aa a) _ ¢ 2 . . . . . 4 . . . 
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A simplified reservation system is shown in Fig. 1-13. Each ticket 
allice contains a reservation interrogation and sales machine, which is 
sonneeted by transmission lines to a central service area. At the central 


5 2 25 3 3 25 service area, records are electronically stored on the availability and dis- 
a; a¢ $2 a8 jonition of all seats on all airplanes to be flown by the airline, for about a 
3% wg as ag month in the future. The processing of a reservation starts at the ticket- 

bes + atlive machine. Let us say that a customer desires two seats from 





Washington, D.C,, to Chicago on Apr. 30 on the 3 p.m. flight. The 
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sales clerk observes that Flight 83 leaves Washington, D.C., at 3 P.M. 
and goes to Seattle via Chicago, and chooses the card for Flight 83 from a 
file box. Each flight card contains information about two flights, one 
on each of the long edges. Along the edge are printed the scheduled 
stops, leaving a margin which is punched with a code representing the 
flight number. The clerk puts the punched margin of the card into a 
slot in the ticket-office machine, then presses buttons indicating the 
number of seats (two) and the date (Apr. 30) desired. The names of the 
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Fra. 1-13. Airline reservation system. 


cities on the card appear opposite a row of buttons on the machine, an 
the clerk pushes the buttons next to the names of the desired cities, Wash 
ington and Chicago. Then he presses the / (interrogation) button 
Meanwhile the remote scanner (see Fig. 1-13) is searching for a ticket 
office machine with a card in it. When one is located, the remot 
scanner connects the ticket-office machine with the computer. Th 
marginal punches on the card are sensed by the computer and tell i 
which flight is being interrogated; the date, number of seats, etc., are 
also transmitted to the computer. If two seats are available on Flight 8 
for Apr. 30, the computer transmits the information to the ticket-office 
machine by illuminating the proper lights on it. If the customer desires 
to make the reservations, the R (reservation) button is pushed and the 
computer subtracts two from the number of seats available on light 83. 
If no seats were available, the no-seats-available light would have been 
illuminated, Similarly a cancellation ean be made, and so forth, Of 





















filing system (see I*ig. 1-14). 
with individual physicians and hospitals within its area, receiving, trans- 
Milling, and computing medical information as required. 
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wourse our illustration has been greatly simplified, but the essential 
operations performed by the computer and the problems involved have 
hoon demonstrated. 

Aids to Medical Diagnosis. It has been recognized that electronic 
vomputers can aid certain aspects of medical diagnosis. Jor example, the 
‘omputer can (1) produce a list of possible diagnoses, consistent with 
inedical knowledge, for a given set of symptoms presented by a patient; 
(2) indicate further diagnostic tests which best differentiate between 
fomuining diagnostic possibilities; (3) calculate the probabilities for the 
llernate diagnostic possibilities; and (4) enable a more precise statement 
uid analysis of the value decisions which may be associated with treat- 
went planning. Such computer applications must be based on extensive 
medical data; hence a further use of computers is (5) to compile statistics 
(hat (a) relate symptom-disease combinations and that (b) evaluate 
(Iincuse-treatment-prognosis results. These statistics are then used in 
fiilure diagnosis, treatment, and prognosis determination. In addition, 
{he computer can serve as an indispensable aid, under certain special 
viveumstances: The computer (6) can enable criteria of a more quanti- 
fulive nature to be utilized in evaluating the results of certain diagnostic 


procedures (such as electrocardiograms, electroencephalograms, etc.) and 
mun perform the complicated calculations necessary for the proper inter- 
jrelation of certain clinical measurements. Finally, the computer can 


siwnilicantly aid the important aspect of information retrieval in, for 
pxumple, (7) the rapid retrieval of current information about new pre- 
yentive measures, diagnostic techniques, and specific treatments, and 
(8) the accumulation and retrieval of desired aspects of a particular 
jutient’s total medical history (such as total radiation dosage received, 
jwevious allergic reactions, individual biochemical and physiologic norms 
and deviations, ete.). 

Consider a modern high-speed electronic digital computer, and suppose 
that it has been programmed to aid medical diagnosis. Let us assume 
further that the physician can directly communicate with the computer 
iy telephone, teletype, radio, etc. The value of such a computer interro- 


Wiulion arises from three factors: (1) the ability of the computer to formu- 


lutea treatment plan that will maximize the chance of curing the patient; 
() the ability to determine the minimum number of necessary medical 


luhoratory tests or other diagnostic procedures for the particular patient; 
wud (3) the ability to evaluate more accurately diagnostic-test results 
fora particular patient based upon his previously recorded health records. 


\ network of such computers could form a hypothetical health-com- 
Here each computer can communicate 


The great sig- 
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nificance and importance of such a health-computer network cannot be 
overestimated as an aid to increasing individual good health and longevity 
and as a vast new source of medical information concerning mankind. 
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Fic. 1-14. Hypothetical health-computing system. 


1-6. Block Diagram of a Computer 


The purpose of this section is to acquaint the student briefly with the 
parts of a digital computer. To this end the block diagram of a computer 
will be developed in steps, from the simple to the more detailed. 

The Computer. As was seen in the previous sections, a computer 
handles numbers; it takes numbers that are fed into it and operates on 
them to form new numbers—the output numbers. Hence the simplest 
block diagram of a computer is as shown in Fig. 1-15. 


Input numbers Output numbers 


Fic. 1-15. Simplest block diagram. 


The computer can perform many different kinds of operations on num 
bers. Most of these operations combine two numbers to produce @ 
third number. The two numbers so combined are called the arguments 
of the operation, and the number produced is called the result. Many 
operations that a computer can perform are similar to the familiar oper- 
ations used in hand computation, such as adding two numbers to form the 
sum, multiplying two numbers to form the product, etc. Also analogous 
to hand computing is the fact that the computer can perform only one 
such operation at a time.t Hence, when many operations are to be 
performed, they must be performed in some appropriate sequence. 

The computer is told what operations to perform by means of instrue= 


+ Modern computers are being developed to perform several operations simul« 
taneously. 
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lions. An instruction not only contains information about an operation 

\o be performed but also designates the two arguments (usually) for the 

desired operation. When the operation specified by an instruction has 

heen completed by the computer, i.e., when the result has been obtained 

(he instruction is said to have been executed. 
l’or example, suppose that we wanted to compute the value of 


y = a(b +c) 


where a, b, and ¢ represent some specific real numbers. A sequence of 
jnatructions that would tell a computer to form a(b + c) is as follows: 








Operation | 1st argument | 2d argument Result 
Inwtruction 1..............{ Add b c x=bt+e 
Inmtruction 2............0. Multiply a x y = ax 





‘The computer would first execute instruction 1 and then instruction 2. 
The Memory and Computing Unit. The computer itself is composed of 
{wo main parts, a memory, which stores numbers and instructions, and a 
computing unit, in which the actual computations take place (see Fig. 
\-16). The computer’s memory stores the numbers to be operated on; 






Computer 
memory 


Input numbers Output numbers 


Computing unit 


Fig. 1-16. Memory and computing unit. 


it stores intermediate results that are generated during the course of a 
computation; and it stores the final results. For instance, the numbers 
represented by a, b, and c of our example are initially stored in the 
memory. After the first instruction is executed, the intermediate result 
» is stored in the memory. After the second instruction is executed, the 
{inal result y is stored in the memory. 

The instructions themselves are also stored in the computer’s memory. 
ach instruction is transmitted to the computing unit when it is to be 
executed, In our example, instruction 1 is first transmitted to the com- 
puting unit and executed, and then instruction 2 is transmitted to the 
computing unit and executed. 

he reason for having an internal memory in the computer is that 
numbers and instructions can be transmitted between such a memory 
and the computing unit more rapidly than by other means. This enables 


Bais 
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the computations to proceed more quickly. There are high-speed com- 
puters in which a number or instruction in the memory can be trans- 
mitted to the computing unit in as short a time as 1 usec. On the other 
hand, arithmetic operations such as addition can be performed in less 
than 1 usec. Hence, it is generally the speed of the memory that limits 
the speed of a computer. 

It is seen from the above discussion that the computing unit has two 
functions: it must (1) obtain instructions from the memory and interpret 
them, as well as (2) perform the actual operations. Hence it is com- 
posed of two parts: the control, concerned with the former, and the 
arithmetic unit, concerned with the latter function (see Fig. 1-17). 






Computer 
memory 


a - 
Arithmetic Control 
unit 


Fig. 1-17. Arithmetic unit and control. (Solid-headed arrows indicate information; 
hollow-headed arrows, control signals. ) 
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Analogy between Hand Computation and the Functions Performed by the 
Parts of a Digital Computer. A general rule for evaluating the capabili- 
ties of a digital computer is this: If the steps in the solution of a problem 
can be broken down into sequences of unambiguous instructions that con- 
ceptually could be performed by a very patient secretary who has no 
knowledge of the subject matter but who has infinite perseverance and 
can follow instructions precisely, never making an error, then the prob- 
lem can be solved on a digital computer. This rule is an oversimplifi- 
cation only in so far as the necessary computation time on the computer 
and the cost of this time are also restrictions. It is instructive to carry 
our analogy still further, describing the functions of the various parts of 
the computer (see Fig. 1-18). The computer memory is analogous to the 
worksheet and instruction list of the hand computation: the memory 
stores the initial data to be operated on, the intermediate and final results, 
and also the instructions themselves. The control interprets the instruc- 
tions regarding the operation to be performed, the arguments to be oper- 
ated upon, and the instruction to be chosen next; this is analogous to the 
computing secretary’s mind. The actual operations indicated by the 
instructions are performed in the arithmetic unit, analogous to the secre- 
tary’s desk calculator. 

The Control. The control itself must perform two functions. It must 
(1) interpret the instruction; then, on the basis of this interpretation, it 
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must (2) tell the arithmetic unit what to do. The latter function is 
accomplished through the use of electronic control signals. 
In accordance with these two functions, we can separate the part of 


Se 






\ 
Input numbers 1 
and instructions | 


Arithmetic unit 





Wig, 1-18, Analogy between manual computing and an electronic digital computer. 
I'he arrows indicate the flow of information and control signals. 
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Fig. 1-19. Instruction decoder and control generator. 


Arithmetic 
unit 





(he control that interprets, or decodes, the instructions, called the énstruc- 
fron decoder, from the part that generates the control signals, called the 
sontrol generator (see Fig, 1-19). 

After an instruction has been transmitted to the instruction decoder, 
where it is interpreted, the control generator senses this interpretation 
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and then produces signals that tell the arithmetic unit which operation 
to perform. It also generates signals that choose the proper numbers 
(arguments) from the memory and sends them to the arithmetic unit 
at the proper time; and when the operation has been performed, still 
other control signals take the result from the arithmetic unit back to the 
memory. After an instruction has been executed, the control generator 
produces signals that enable the next instruction to go from the memory 
to the instruction decoder. In this way the instructions are performed 
sequentially. 





Control Instruction 
generator we decoder 


4 








Fic. 1-20. Input-output. 


Input-Output. There are many means by which numbers and instruc- 
tions can be initially introduced into and finally read out of the computer 
memory. They can be typed directly into the computer on specially 
adapted electric typewriters, read into the computer by means of punched 
paper tape on a teletypelike machine, punched on cards and read into 
the computer by means of a punched-card machine, memorized on mag- 
netic tape and read into the computer by a specially built tape recorder, 
etc. When the computations have been completed, the resulting num- 
bers can be read out of the computer by having the computer operate a 
typewriter, punch paper tape, punch cards, put pulses on magnetic 
recording tape, ete. Most computers have several such input units and 
output units. The in-out selector determines which unit will read infor- 
mation into or out of the computer; it is controlled by signals from the 
control generator (see Fig. 1-20). 

Summary. The computer’s memory stores the instructions and the 
initial, intermediate, and final numbers involved in a computation. 
Instructions tell the computer what computations to perform, An 
instruction includes information about the kind of operation to be per- 
formed and the arguments involved in the operation, 

The instruction decoder interprets an instruction, This interpretation 
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ix sensed by the control generator. On the basis of that information, the 
control generator produces control signals that transmit the proper 
numbers from the memory to the arithmetic unit and that tell the 
arithmetic unit what operations to perform on these numbers or argu- 
ments. When the results have been computed by the arithmetic 
nit, the control signals guide them back to the memory. Then the 
control signals bring the next instruction into the instruction decoder, 
and the cycle continues. When new data is to be read into the memory 
hy means of the input units, the control generator directs the in-out 
welector to choose the proper input unit. The control generator takes 
similar action when computed data is to be read out of the memory 
hy means of one of the output units. 


1-7. Functional Description of a Computer 


The functional description of a computer involves consideration of 
further details about the computer’s memory and the relation of an 
instruetion to the memory. Two fundamental concepts of modern 
digital computers are involved in this relationship. 

Words. The memory of a computer is composed of a large number of 
memory boxes, analogous to mailboxes. The information, or items to be 
iemorized, is placed into these boxes as mail is placed into mailboxes. 
Mach memory box, like a mailbox, has a name, or address. These 
addresses are usually numbers. For example, if the computer contains 
100 memory boxes, their respective addresses might be the numbers from 
| to 100 (or O through 99). The information or items put into the 
memory boxes are called words and are analogous to the mail itself. 
However, only one word can be stored in a memory box at a time, As was 
acon in Sec. 1-6, there are two basic types of information, or words, that 
ean be put into the boxes: words that are numerical quantities and words 
(hat are computer instructions. 

At this point a short digression on computer terminology will prove 
helpful. The memory box of the previous paragraph is more correctly 
called an address, or alternatively a memory address, memory location, 
storage location, or cell. Instead of saying, ‘‘A word is in a memory 
hox,”? computer personnel say, “The contents of an address is a word.” 
Instructions are sometimes called commands; quantities are sometimes 
culled data. t 

An instruction word looks like a number, and there is no way to tell 
from the word itself whether it is a quantity or an instruction. The 
vomputer must be told explicitly which addresses contain instructions 

{ Seo First Glossary of Programming Terminology, J. Assoc. Computing Machinery, 


June, 1954; IR, Standards on Electronic Computers: Definitions of Terms, 1956, 
Proce, TRE, vol, 44, no, 9, September, 1956, 





32 MACHINE LANGUAGES {CHap. 1 


and which contain quantities. Ordinarily a word is treated by the 
control as an instruction and by the arithmetic unit as a quantity. The 
fact that it is only the interpretation of a word that distinguishes instruc- 
tions from quantities is the first fundamental concept of modern digital 
computers. This is fundamental because it enables the computer to 
operate on instructions. That is, the computer itself can formulate or 
change an instruction by treating it as though it were a quantity; and 
then, at a later time during the computations, this changed instruction 
can actually be executed. This gives the computer the capability of 
actually writing instructions as well as executing them; i.e., the computer 
can “tell” itself what to do—which is the closest that a computer proba- 
bly ever comes to “thinking.” More will be said about this in Part 2. 

Instructions and Addresses. To see the relationship between an 
instruction and the computer’s memory, consider, for example, an instruc- 
tion that tells the computer to add two numbers. The instruction does 
not directly tell the computer to add the numbers themselves but rather 
tells the computer to add the numbers that are found as the contents of 
addresses specified in the instruction. If the numbers to be added are 
located in address a and address 8, the instruction literally says, ‘“Add 
the contents of address a to the contents of address B.” Of course, the 
proper numbers to be added must have been previously placed into the 
addresses aand B. The same instruction might go on to tell the computer 
to put their sum into address y and that the next instruction will be found 
as the contents of address 5. Hence, after our add instruction has been 
executed, the number that is the sum will be found as the contents of 
address y. It must be certain that the contents of address 6 is actually 
an instruction and not a number because the contents of address 6 will be 
interpreted as if it were an instruction in any case. 

The fact that instructions deal directly with addresses whose contents 
are to be operated on, rather than with the numbers themselves, is the 
second fundamental concept of digital-computing machines. This gives 
the computer an advantage that is exactly analogous to the advantage of 
performing algebra on symbols x, y, - - - - 

As an example of an instruction, suppose that we wanted to add 125 
and 412 and that we put 125 into address 32 and 412 into address 34. 
Suppose that we want the sum to appear in address 36 and that the next 
instruction is the contents of address 41. Then our add instruction, 
when decoded, would say, ‘‘Add the contents of address 32 to the con- 
tents of address 34, and put the sum into address 36; then the next 
instruction will be found as the contents of address 41.” 

The Accumulator. Consider now what happens in the arithmetic unit 
while an instruction is being executed, In most computers only one 
word at a time can be transferred between the arithmetic unit and the 
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memory. Hence, to perform an operation involving two arguments, the 
first argument must be transferred from the memory to the arithmetic 
unit and stored there temporarily while the second argument is being 
(runsferred. The special memory cell in the arithmetic unit for this 
purpose is called the accumulator (see Fig. 1-21). When the operation is 
performed, the result is formed in the accumulator before it is transmitted 
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lia. 1-21. Accumulator, instruction register, and current-address register. 


‘Thus, in the above example, the contents of address 32, namely, 125, 
would be brought into the accumulator first; then this would be added 
(0 the contents of address 34, namely, 412, and the result, namely, 537, 
would be formed in the accumulator. Then 537 would be transmitted 
from the accumulator into address 36. 

The Instruction Register and the Current-address Register. Next con- 
sider the instruction decoder that interprets an instruction. In order 
(hat the instruction decoder perform its function, it must constantly 
refer to the instruction being interpreted during the time control signals 
ave being set up. To facilitate this, while an instruction is being exe- 
wiuted it is stored in a special memory cell, called the instruction register, 
located in the instruction decoder (see Tig. 1-21). 

‘There is another special memory cell located in the instruction decoder, 
eulled the current-address register (see Fig. 1-21). The contents of this 
hogiater is (nearly) always the memory address from which the instruc- 
(ion being executed came. The reason for this is related to the fact that 
the address of the present instruction was given as part of the previous 
jnatruetion, 
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For example, consider the add instruction described above. Suppose 
that this instruction were originally the contents of address 40. Then, 
while it is being executed, the contents of the instruction register will 
be the add instruction itself, and the contents of the current-address 
register will be 40. After the add operation has been executed (i.e., 
after the sum of 125 and 412 has been put into address 36) and just 
before the next instruction is read into the instruction register, the address 
of the next instruction, namely, 41, is recorded in the current-address 
register. 

The Four Phases. The functioning of a computer during the execution 
of an instruction is often summarized in terms of four phases. Assuming 
that an instruction has already been transmitted into the instruction 
register, phase 1 involves the transmitting of the first argument from the 
memory into the accumulator. During this phase the instruction decoder 
determines the address of the first argument from the instruction, and 
the control generator produces control signals that transmit the contents 
of this memory address into the accumulator. During phase 2 the second 
argument is brought to the arithmetic unit and the operation is per- 
formed, Both these functions are included in the same phase because 
most often the operation is performed as the second argument enters the 
arithmetic unit. In this phase the instruction decoder determines from 
the instruction the address of the second argument and what operation 
is to be performed; the control generator generates signals that set up the 
arithmetic unit to perform this operation and signals that transmit the 
contents of the address of the second argument into the arithmetic unit; 
the arithmetic unit performs the operation, generally leaving the result 
as the contents of the accumulator. During phase 3 the contents of the 
accumulator is transmitted into the memory address specified by the 
instruction. During phase 4 the address of the next instruction is ascer- 
tained, and the contents of the current-address register is changed to the 
new address; the contents of this new address is then transmitted into 
the instruction register as the next instruction. Then phase 1 for this 
instruction is initiated, and the process is repeated. 

It should be observed that computers are often designed with phase 1 
or 3 omitted. That is, in some computers the memory address of both 
arguments need not necessarily be specified in a single instruction, nor 
need the memory address into which the result is transmitted always be 
specified. In such computers multiple instructions would be equivalent 
in function to a single instruction as described above. This aspect of 
instructions is discussed in considerable detail in Chap. 3, and we shall not 
press the point here, except to note that, although the principles described 
hold in general, the specific details vary with different computers. 
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Summary. There must be an instruction in the instruction register 
initially so that the whole process may begin, and there must be a way 
to stop the computer. These details will be considered in Chap. 3. 
he important points we wish to emphasize in this section are these: 
(1) instructions and quantities are memorized by the computer as the 
contents of memory addresses; (2) they are indistinguishable from each 
other except by interpretation, and (3) instructions explicitly involve only 
addresses and tell the computer what to do with the contents of these 
addresses; (4) the functioning of a computer during the execution of an 
instruction can be summarized in terms of the four phases as described. 


1-8. Words and Pulses 


Binary Numbers and Bits. Words are memorized in the computer as 
binary numbers. Hence it is necessary to divert our attention for a 
moment to a description of the binary notation for numbers. Numbers 
written in the binary system make use of only the symbols 0 and 1 
walled bits, just as numbers in the decimal system use the symbols 0, 1 2. 
i, 4, 5, 6, 7, 8, and 9, called digits. We say, for example, that 1101 a ‘ 
hinary number composed of 4 bits, just as 8,675 is a decimal number 
vomposed of 4 digits. In the decimal system the number 8,675 really 
means (8 X 108) + (6 X 102) + (7 X 10") + (5 X 10°), where 10° = 1; 
inulogously, in the binary system the number 1101 really means (1 X 23) 
} (1 K 27) + (0 X 2") + (1 X 2°), where 2° = 1. Since 2% = 8, 2? = 4 
Q! = 2, and 2° = 1, 1101 is (1 x 8) ++ (1X4) + 0X2) +0 X1 = 13 
in decimal. A more general and thorough discussion of the binary- 
Mumber system appears in Chap. 2. Our purpose here is simply to show 
(hat binary numbers are made up of bits, 0 and 1, and that a binary num- 

hor is just another notation for our familiar decimal numbers. 

i ords as Pulse Trains. As can probably be foreseen by the reader 
al this point, the reason for using the binary numbers in digital computers 
in to enable the use of simplified electronic circuitry. For then only two 
signal voltage levels need be distinguished, the voltage level that is to 
represent 1 and the voltage level that is to represent 0. A word is memor- 
iwod by the computer as a series, or train, of 1 and 0 signal voltage levels 
often called pulses, that represent the bits of the word. The number of 
bite ina single word is called the length of the word. All the words within 
i particular computer are of the same length;+ hence, so are the contents 
vf ull addresses in the computer memory. When the proper signals are 
generated by the instruction decoder, the train of pulses that represents 


! Modern computers are bein i 
/ g developed to use different word lengths in di 
seotions of the machines, penny non 
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a word goes from the memory address to the arithmetic unit. In some 
computers this is accomplished in a serial manner in time; i.e., the pulses 
flow past a fixed point at some fixed rate, say 1 pulse/psec. The speed 
of a computer is usually denoted by this pulse rate, or equivalently by 
the pulse frequency, i.e., 1 megacycle for our illustration. ; 

It is often important to be able to denote a particular bit in a word 
by means of its position in the word. For this purpose we number the 
positions from right to left. Then Pn represents the nth bit of a word. 
For example, consider the 30-bit word in Fig. 1-22. Here Pl = 0, 
pP2=0,..., P6=1, P7=9, PRE 1 3 by POS APS Dy 
etc. 


ROW EMA AOD ORL HEM ATO ANN OI MONG 
AANA ANN NANA detest tee aa 

A Ay Ay AY AGAY AY AYAY Ay Ay Ay AY AY AY AyAYAY AYALA, AY AYA, AYALA, 
011011010011100011110 100000 
Fic. 1-22. Example of a 30-bit word. 


Time Synchronization. In the arithmetic unit two word pulse trains 
are operated on by electronic circuits to form a new word pulse train that 
represents the result of the operation. In the control the word pulse train 
is decoded and interpreted as an instruction by other electronic circuits. 
The circuits that accomplish these functions are called gates, or digital 
switching circuits. Everything that happens to a word during the four 
phases takes place in these switching circuits, or gates. 

It has been implied in the above discussion that there is some kind 
of synchronization of the pulses in a computer. In most computers 
this is indeed the case: all pulses are synchronized with respect to each 
other by means of clock pulses generated by a clock within the computer. 
The clock is an oscillator, with a frequency that is usually some integral 
multiple of the speed of the computer. Every gate is so connected to 
the clock of the computer that pulses are allowed to enter or leave the 
gates only at those times at which a clock pulse is received. The effect 
is analogous to an orchestra where the conductor beats out the timing, 
like the clock—the individual musicians, the gates, play notes, the pulses, 
according to this rhythm as directed by their music sheets, the design 
of the gating connections. At each beat of the clock all the pulses 
advance in the computer according to the gating connections, just as the 
musicians go on to the next musical beat. The purpose of this is to 

keep the phases of the pulses coordinated. 

Example of Adding. Let us consider what happens in the arithmetic 
unit when two numbers are added, To be specific, suppose that these 
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itmbers (where An and Bn label the bit positions) are 


RES & = ‘ 

R28 NR2 ARN HRA SNS Bee Ange omn eet san 
— SES SSS S55 S55 S55 eeq qeq x34 
000 010 110 000 001 111 111 001 110 011 
RTP HOW YON BOD on 

M IN NAN NAN NAS = ~ ex galboct Spe O20 oe 

SS Sh AHS RRR RRR SER SRR ay mere een 
000 001 011 101 110 101 111 010 100 000 


( he decimal equivalents are, for A, 46,202,483 and, for B, 24,600,224 
Moe ( hap. 2 for decimal-to-binary conversion methods.) siippene ‘that 
we are in phase 2; i.e., the first argument has already been irangrnttted 
into the accumulator. Suppose also that the pulse rate of the comput 

in | usec; then the bits advanced once each microsecond. Fi ea 93 
shows the situation at the start of phase 2; Fig. 1-24 shows eticaton 
4 ines later Fig. 1-25 after 2 usec; Fig. 1-26 after 10 ysec; Fig. 1-27 sitar 
4) usec; and Fig. 1-28 after 30 usec. As the two arguments advance 
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Fia. 1-23. Start of phase 2. 
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Fia. 1-24. One microsecond after start of phase 2. 
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Fig, 1-25, Two microseconds after start of phase 2. 
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bit by bit into the adder, the result is formed bit by bit; the partial results 
are stored in the accumulator behind the partially used first argument as 
shown. The double line separates the partial first argument from the 
partial result so far formed in the accumulator. (The partial result is 
to the left of the double line; the remaining first argument is to its right.) 
In Fig. 1-28 the complete result has been formed in the accumulator, 


namely, 


RAQ RAK BRR RRR RRN RRQ RRM” MRR RRR RRNMN 
000 100 001 110 000 101 110 100 010 O11 


(which in decimal is 70,802,707). The computer is then ready to start 
on phase 3. 

Figures 1-23 to 1-28 illustrate what would happen in a so-called “serial’’ 
computer. However, there are other, “parallel” methods for adding, in 
which all the digits of the sum are formed at the same time. Our purpose 
here was merely to give a general idea of how word pulse trains travel 
through gates in a synchronous fashion during a computation. 
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Fig. 1-27. Twenty microseconds after start of phase 2. 





Accumulator DANN NNN gates 


(esa ofoesIsrofeoelroalsrelzeoTe toes 


Fig, 1-28. Thirty microseconds after start of phase 2. 
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Instruction Decoding. We have given an example of how numerical 
(\uantities are gated. Next let us illustrate the method by which an 
\imtruetion word is written and decoded. Suppose that the words of a 
l\ypothetical computer are each 30 bits in length, as above. Let the 
loftmost six bits, P30, P29, P28, P27, P26, P25, represent the operation 
wode, ie., tell what the instruction is to do: add two numbers, or form the 
product of two numbers, ete. Suppose, for concreteness, that the 
binary number 101 011 is the code for ‘‘add” (see Fig. 1-29). Let 


oO wo st an oO am N n~n wo ct 
~m NN a et a 
A AQ A AQ AQ a A Q 
101 011 011 010 011 100 011 110 100 000 
Operation The address The address The address The address 
code of one of the other into which the of the next 
operand operand result goes instruction 
Fig. 1-29. Example of an instruction word. 
jhe next six bits, P24, ... , P19, represent the address of one of the 
\umbers to be added; P18, . . . , P13, the address of the other number; 
and P12, ... , P7, the address in which the sum is to be stored. The 
junit six bits, P6, . . . , P1, tell where the next instruction is located in the 


jwomory. When the instruction word goes into the instruction decoder, 
\i is stored in the instruction register. This register has a circuit for 
wach bit being stored, with an output wire so that, if a 1 is stored, the 
vireuit puts out the 1 signal voltage level on the wire, and the 0 signal 
voltage level if a zero. The outputs on these wires are sensed by gating 
vireuits, and the instruction is thus decoded (see Fig. 1-30). 

The control unit generates control pulses which are also binary in 
haracter, i.e., are composed only of the 0 and 1 voltage levels. In fact, 
almost all the circuitry in a digital computer is concerned with only these 
wo voltage levels. This distinction can be made as to the function of 
jiilxes in the computer: some pulses represent words (numbers or instruc- 
ions); other pulses represent control signals. Such a distinction was 
made in the block diagram of a computer, where solid-headed arrows 
ywpresented the flow of word pulses and hollow arrows represented the 
flow of control-signal pulses. 


TT 


meal en 

To control unit for: To control unit for: To control unit for: To control unit for: 

arithmetic unit choosing proper choosing proper choosing address of 
control addresses of operands address of result —_ next instruction 


ia, 1-30, Instruction register, 
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1-9. Input, Output, and Memory Systems 


Input Information. As we have seen, there are many ways to get 
information into and obtain information from a digital computer. In 
order to give a more concrete picture of the process, we shall describe 
in some detail the procedures a person would follow in order to put 
information into a computer. The input information usually consists 

of a list of numbers and a list of instruc- 

‘ili tions telling the computer what operations 

0 00| to perform on these numbers. For exam- 

oO o|. ple, a problem might be to compute the 

0 o| trajectory of a missile (i.e., to construct a 
00 table giving the position of the missile at 
o| each second during its flight) given its initial 
000|_ Velocity and angle of take-off. Suppose 

O that the equation of the trajectory is 


[oe r= (Vo cos yt y= (Vo sin 6)t _ logt 


One word 











Electromechanical 
part of computer 
that looks for holes 





where 2,y = coordinates of missile at any 
time 
Vo = initial velocity 

angle of take-off from hori- 
zontal 

Among the list of numbers that must be 
entered into the computer are the values 
of Vo, cos 6, sin 6, g, and the constant 1 
(the time interval of 1 sec). A list of in- 
structions must also be read into the com- 
puter telling it how to compute x and y. All these numbers and instruc- 
tions must be entered into the computer in binary form. The first 
problem under consideration in this section is just how this might be 
accomplished. 

Suppose that punched paper tape is to be used for this process. An 
electromechanical device exists so that a hole in the tape is interpreted by 
the computer asa 1, no hole asa 0. Consider a paper tape with positions 
for three holes per line. If there are 30 bits to a word, then 10 lines are 
needed per word. Figure 1-31 illustrates how the following word would 
look on a tape: 


111 101 011 110 O01 111 O10 110 O11 001 


Il 


Next word |O O O 


Fia. 1-31. Punched paper tape. 


One method for getting the proper holes in the paper tape is to use a 
hand paper-tape punch machine. This manual punch machine has keys: 
pushing a single key will punch one line. The keys might be numbered 
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from 0 to 7, and the array of holes corresponding to each key might be as 
wiven in Fig. 1-32. Hence the word above would be punched by pushing 
(he following keys in succession: 


7, 5, 3, 6; 1, 7, 2, 6,3,1 


‘lhe person who prepares the list of numbers and instructions for a prob- 
lom is called a programmer, or coder, and the list itself is called a coding 
sheet. The code is written as a list of computer 


words. However, the coder does not write the Key 

words in binary form but instead puts down, as a (0) 

shorthand, the numbers of the keys that are to be 

juinched. Hence instead of the word above, the o . 

woding sheet would have = S 
(3) [oxe) 

75 36 17 26 31 (4) |0 

The punch operator would push these keys in suc- er (O° 9 

Wonsion, and the paper tape would be punched in a (6) joo 

hinury mode. After the punching is complete, the Mm jO0°0 


jiper tape is run through the computer paper-tape 
jonder, which translates the holes into the units and 
yoros that are electronically stored in the comput- 
o's memory. The computer then calculates the 
fonults, and they are read out—perhaps by having the computer punch a 
jiper tape. This paper tape would then be put into a machine that does 
(he reverse of punching, i.e., that looks at a paper tape and prints on 
paper the numbers that appear there, in a manner similar to a ticker-tape 
Machine. 

Other Input-Output Methods. Besides punched paper tape there are 
inched cards: the cards are punched in much the same manner as the 
juper tape, and the device that reads the cards into the computer likewise 
wnses the holes and interprets these in terms of electronic pulses. A 
\ypewriter may be rigged with relays so that when a key is pushed a code 
is produced by the relays and transmitted directly into the computer. 
Much methods are often found too slow. <A faster method for putting a 
jrogram or code into a computer is by means of magnetic tape, like sound- 
icording tape (see Fig. 1-33); here a small magnetized area on the tape is 
jnlorpreted by the computer as a unit, no magnetized area as a zero. 
llowever, before this magnetic tape is read into the computer, the pro- 


Fig. 1-32. Punched- 
paper-tape code. 


yam must be first “written” onto the tape. This may be accomplished 
#0 & machine not associated with the computer itself, often called an 


"\nseriber,”’ There are many varieties of inscribers; some enable the code 


fo be put on the magnetic tape directly from a typewriter, others from 
jreviously punched paper tape or cards, lor example, one procedure 
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(a) 





(b) 


Fig. 1-83. (a) Magnetic-tape unit; (b) magnetic tape, showing locations of bits, 
(Photographs courteay Potter Inatrument Company, Ine.) 
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muy require that the code first be punched on paper tape and the paper 
iupe “inscribed” onto magnetic tape, which is finally read into the com- 
puter. Table 1-1 gives some idea of the speeds in bits per second at which 
\nformation can be transformed from one medium to another. Knowing 
ihe number of bits per word, one can then calculate the number of words 
per minute that can be so transformed. 


TaBLE 1-1. ORDER OF MAGNITUDE OF RATES 














From To Speed 

Papor tape................. Computer memory 2,000 bits/sec 
Wunched cards.............. Computer memory 3,000 
Typowriter...............-. Computer memory 40 
Magnetic tape.............. Computer memory 100,000 

Vapor tape..............06- Magnetic tape 2,000 
Wiunched cards............-. Magnetic tape 3,000 
Typewriter.............2.-- Magnetic tape 40 
Computer memory.......... Paper tape 300 
Computer memory.......... Punched cards 2,000 
Computer memory.......... Magnetic tape 100 , 000 
Computer memory.......... Typewriter 600 characters/min 
Paper tape..........-.....- Typewriter 600 
Wunched cards.............. Printer 10,000 
Magnetic tape.............. High-speed printer 100 , 000 
Computer memory.......... High-speed printer 100,000 
Computer memory.......... Cathode-ray-tube printer | 500,000 








There are also some unusual means for putting information into the 
womputer. One is a reading machine that can translate digits and letters, 
us they appear on a printed page, into a binary code and write this code 
jnto the memory. 

Most of the methods for taking information from the outside world 
jnlo the computer’s memory can be reversed to serve as methods for 
wading information out from the computer’s memory to the outside 
world in a form that is readily understandable. Hence the computer 
wan work a typewriter, punch paper tape, punch cards, or write onto 
inugnetic tape. Of course the ‘“‘outscriber’”’ is required to transform the 
information from punched paper tape, punched cards, or magnetic tape 
into the letter and number characters we are more familiar with. In 
addition there are methods using so-called “high-speed” printers and 
specially made cathode-ray tubes (see Figs. 1-34 to 1-36) that like the 
iypewriter display numbers and letters directly from the computer. 

Other outputs from a computer may be obtained from attaching 
oncilloscope, or television, tubes to the computer’s memory, Then the 
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computer literally can draw maps, can indicate the movements of dots 
representing aircraft, etc., or can display other kinds of pictures. 
Memory Systems. Words read into the computer go into the com- 
puter’s memory. The function of a computer memory is to store the 
pulses of each word in such a way that they may be called into the arith- 
metic unit rapidly, upon generation of the proper control signals. In 





Frc. 1-34. High-speed printer showing type-face drum. As the drum rotates, @ 
hammer pushes the paper up against the proper letter in each column. In this way 
an entire line is printed at one time during each revolution of the drum. (Photograph 
courtesy Potter Instrument Company, Inc.) 


addition, the computer memory must be able to accept rapidly the pulse 
train of a word for storage. The speed of modern computers is essentially 
limited by the speeds of access to their memories. 

An important point that is true for practically all kinds of memories 
is that when a word is read out of a memory the word is still stored in the 
memory (unless otherwise specifically stated by the instruction—see 
Chap. 3). On the other hand, when a word is written into the memory 
from the arithmetic unit, the previous word of that address is automatically 
erased and replaced by the new word, 
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There are many ways of memorizing words in a digital computer. 
Magnetic tape, described above as an input-output system, can as well 
be used for a computer memory. A similar magnetic-pulse technique is 
incorporated into magnetic drums, which are simply rapidly rotating 





Wu, 1-85. Sample print-out from a high-speed printer. This printer uses a type-face 
hain, as shown, instead of a drum. The chain moves horizontally behind the paper, 
which moves vertically. When the proper character is aligned with a given position, 
« hammer strikes an inked ribbon against the paper and type face. (Photograph 
rourteay International Business Machines Corp.) 


vylinders with magnetizable surfaces (see Fig. 1-37). Faster memories 
are constructed from magnetic cores, specially prepared from ferrites 
lwving a square hysteresis loop, thereby presenting two stable states. 
ach core then represents one bit of a word (see Fig. 1-38), An even 
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Selenium- Charactron tube 3} + y os y 

Display-positioning coated ti > x *4 
‘ electrodes drum Lens é of? 'e it a ; 
Character-selecting TL ,Drum surface charger t iF; @ Na 

electrodes witA <?, fe _ ‘a 

Ink powder Y __ Brushing off surface 8, GS. e ¢ 
ase € A i 
4 Heating 


Character ; 
matrix Printed 


(a) 


(b) 


Fic. 1-36. The Charactron tube coupled with the xerography process for high-speed 


rinting. (a) The cathode-ray tube, showing the character-selecting electende srs 
bead the electron beam through the proper hole in ~ Lapiriaapree rey cc fees 
i frie, itizes the selenium-coa . 
light image from the cathode-ray tube sensitizes ur 
oe eS transfer the ink powder, which sticks to the sensitized area of the paper. 





lia, 1-38. Magnetic cores. (Photograph courtesy International Business Machines 
Corp.) 


faster memory has been constructed, using diodes and capacitors. The 
memory with the fastest reaction time is one constructed using a type 
of gating circuit, called a “flip-flop,” to record each bit. The time re- 
quired to transmit one computer word out of the memory to where it 
will be used is called the memory access time (see Table 1-2). 


TABLE 1-2. OnpER oF MAGNITUDE OF Memory Access TIME 


Memory system Time required to transmit a 30-bit word 
Magnetic tape............. 5 msec +,time necessary to position tape 
Magnetic drum............ 10 msec 
Acoustic delay line......... 100 usec 
Magnetic core............. 5 usec 
Diode capacitor............ 1 wsec 
Flip-flop register........... 1 psec 


In the magnetic-tape memory the whole tape must be searched, and a 
xreat deal of time is lost just winding and rewinding tape. For the drum, 
the computer must wait for the word to rotate to the proper read-out 
position, The magnetic-core arrangement enables any word to be read 
out with no delays except those inherent in the circuitry involved. A 
word is read from a coré memory in parallel; i.e., all the bits are trans- 
mitted to the arithmetic unit at the same time, <A similar situation 





Pia, 1687, Magnetlo drum, (Photograph courteay Royal MeRee Corp.) 
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occurs with diode-capacitor memory or with a memory composed of a set 
of flip-flop registers. 

Unfortunately, the faster the memory, the more it costs per bit. 
Hence a computer usually has several kinds of memory. Most com- 
monly, magnetic tapes and drums are used for the so-called “low-speed” 
memory of a computer, while a magnetic-core memory or diode-capacitor 
memory is used as the “high-speed”? memory. Often there is also 
included in the larger computers a battery of “flip-flop” registers for the 
really high-speed work. In this way, various parts of a large program 
can be stored in the relatively low-speed memory system of a computer 
and then transmitted to the high-speed memory when the actual compu- 
tations of this part of the program are to occur. 


1-10. Exercises 


1-1. What are the general purpose and function of an operation? An instruction? 
The computer memory? The computing unit? The arithmetic unit? The control? 
The instruction decoder? The control generator? Control signals? Input units? 
Output units? The in-out selector? 

1-2. If 2 and 3 are added to obtain 5, what are the arguments of this operation? 
The result? 

1-3. Write a sequence of instructions by means of which a(b + cd)/f can be com- 
puted. Letting a = 6, b =4,c¢ = 2, d =3, and f = 5, perform the sequence of 
instructions. 

1-4. Write a sequence of instructions by means of which (1 — a?)* can be computed. 

1-5. What four functions do the control signals perform? 

1-6. Discuss the first two fundamental concepts of modern digital computers, using 
accepted computer terminology. 

1-7. Consider the example of Sec. 1-6. Suppose that the value of a is found as the 
contents of address 50, of 6 as the contents of address 51, and of cas the contents of 
address 52; the value of the intermediate result x is to be put into address 53, and the 
final result y is to be put into address 54. How would instructions 1 and 2 of that 
example be written if these instructions themselves were found as the contents of 
addresses 55 and 56, respectively? 

1-8. Describe by naming specific addresses and special memory cells what happens 
during each of the four phases during the execution of the instruction given as an 
example on page 32. What is the contents of the instruction register? What is 
the contents of the current-address register? What is the contents of the contents of 
the current-address register? (Hinr: The contents of the current-address register is 
an address itself.) 

1-9. During phase 1 what is the relation between the contents of the instruction 
register and the contents of the current-address register? 

1-10. Define: length of word; pulses; speed of computer; gates; clock pulses. 

1-11. In the addition example given in Sec. 1-8, what is the contents of the accu- 
mulator 15 usec after the start of phase 2? 25 usec after? 

1-12. What keys of a paper-tape puncher would be pushed to punch the number 


101 110 O11 001 101 101 O11 O11 001 100 


(Use the table of Fig, 1-32.) 
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1-13. Draw a picture of paper tape with the number in Exercise 1-12 punched on it- 
1-14. What number does the hole pattern of Fig. 1-39 represent (left to right)? 


[e) (out e) (onze) OO 
(onze) oo 1e) (ome) 
O =O ie) 
Fia, 1-39 
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CHAPTER 2 


PRINCIPLES OF PROGRAMMING 


Q-1. Introduction 


The purpose of the current chapter is to introduce the student to the 
fundamental principles of coding and programming, as opposed to the 
lactics of coding and programming. These principles are concerned with 
sequences of instructions, the coding of decisions, the flow-charting and 
vymbolic-coding aids, and the use of loops and instruction modification. 
Ii order to leave the student free to concentrate on these principles, the 
four-address format, with a minimum number of instruction types, is 
\tilized. This serves to separate consideration of the principles from 
(hat of the tactics. However, it should be pointed out that the four- 
address format is used in this chapter for pedagogical reasons only. 
Discussions of instruction formats in actual use on currently available 
vomputers, and of the large variety of instruction types utilized in actual 
practice, are associated with the tactics of coding and programming; these 
will be considered in detail in Chap. 3. 

The terms coding and programming are often used as synonyms. 
However, a code is more specifically a short list of instructions that direct 
(he computer to perform only a part of the entire calculation, whereas the 
lorm program refers to the complete list of instructions used for the 
problem. Hence the term programming usually includes the over-all 
plunning of the use of the computer for a particular problem as well as 
(he writing of the instruction lists, or codes, whereas coding is usually 
limited in meaning to just the writing of the instructions lists. Some- 
limes a code is called a routine. 

Stages in Programming. As will be seen, there are five stages in pro- 
“ramming. First the computations to be performed must be clearly and 
precisely defined. The over-all plan of the computations is diagramed by 
moans of a so-called “flow chart.’’ The second stage is the actual coding. 
lt is often best to write a code in terms of a symbolic language first, for 
then changes are easily made. Numbers are assigned to the symbols, 
and the final code is prepared. In the third stage some procedure is 
wed to get the code into the memory of the computer. The fourth 
alage consists in “debugging” the code, i.e., detecting and correcting any 

a] 
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errors. The fifth and final stage involves running the code on the 
computer and tabulating the results. In this chapter and the next, only 
the first two stages are considered. Techniques for the fourth stage 
are described in Chap. 5. 

Philosophy of Programming. As will become abundantly clear in the 
following sections, a single error in one instruction invalidates the entire 
code. Hence programming is an exacting technique, requiring attention 
to details without losing sight of the over-all plan. Learning to program 
must be developed from within a person. Following through each 
instruction of the examples given in this chapter is important, but it is 
not sufficient. The student must program problems by himself if he is to 
become aware of all the detailed thinking and reasoning associated with 
the programming technique. The method of teaching in this text is 
through example. The student should try to do the examples himself, 
using the text only as a guide to the methods involved. 


2-2. Number Systems: Conversion 


Number Systems and the Radix. Preliminary to any discussion of 
coding should come an analysis of various number systems. In Sec. 1-8 
we noted that 8,675 really means 


8 x 10?+ 6 X 10?+ 7 X 10' + 5 X 10° 


where 10° = 1. Since decimal numbers are expanded as illustrated in 
powers of 10, we say that the decimal number system has a radix of 10. 
This concept can be generalized to define number systems based on any 
positive-integer radix q: if a number N is based on radix q, then it can be 
expanded as 


N =a.gq" + Gago tess + ag? + ag + aq? 
for integer N > 1, or 
N =ang!+a.g? + a-3¢7% + + °° + Omg” 
for 0 < N <1, where ap, ai, . . . , G1, @-2,... are nonnegative inte- 


gers each less than g. Of course, for noninteger N > 1, the expansion 
contains both the positive and negative powers, 


N = ang? t+ +++ + aeg? + aig’ + aog? 
tang'+aug? +++ + a-ng™ 
¥or example, 
8,675.8675 = (8 X 10%) + (6 X 10%) + (7 X 10") + (5 X 10°) 
+ (8 X 10-') + (6 & 10-*) + (7 X 10-*) + (5 X 10°) 


that; is, a; = 8, a2 = 6, a) = 7, do @ 5, Ga 8, da = 6, ao 7, and 
a_,= 5. Even though we are most familiar with the system forq = ten, 
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history records ancient civilizations that used q = six, q = twelve, and 
eveng = sixty! For our purposes we are most interested ing = 2 (binary 
system), g = 8 (octal system), and g = 16 (sexadecimalt system). 

lirst note that for radix g the number system must involve g symbols. 
lor example, for g = ten, the ten symbols are 0, 1, 2, 3, 4, 5, 6, 7, 8, 
und 9; for g = 2, the 2 symbols are 0 and 1; for g = 16, the 16 symbols 
most often used are 0, 1, 2, 3, 4, 5, 6,7, 8,9, A, B,C, D, E,and F. Then, 
if N = ang” + ang’ + + + + + aoqg? + aig! + aoqg®, where the a; are 
of course chosen from the g symbols, the number is conventionally written 
hy juxtaposition of the coefficients a; as follows: N = Qndn_1 * * * G2Q1Q. 
I N=aiug'+::++: +a_nq-", then the number is usually written 
by juxtaposition of the a; preceded by [.]—that is, .a_ya_. ++ + Qom. 
lor noninteger N > 1 we would have a, -* + @o.@-; °° * Geom. With 
(his in mind, counting in the four different systems mentioned above is 
accomplished as in Table 2-1. 

Converston from One Number System to Another. Using number sys- 
toms based on different radices is like talking in different languages. 
Just as one can translate one language into another, so also one can 
“convert” from one radix to another. For example, in Table 2-1 num- 
hers on the same row are equivalent—that is, 


28 (decimal) = 11100 (binary) = 34 (octal) = 1C (sexadecimal) 


‘The binary, octal, and sexadecimal equivalents of the decimal numbers 
from 0 to 15 should be memorized. 

In order to derive a systematic method for converting from one num- 
hor system to another, we make use of the well-known result that if 
N and q are positive integers then there always exist a unique (non- 
ogative) integer r less than g and a unique integer S such that 

Neas+2 O<4<¢ 

q qd 
let us apply this to convert an integral number with radix p, say N,, 
(0 its equivalent with radix q, say N,. In other words, we wish to deter- 
nine the nonnegative integers @, @1, . . . , Qn, each less than g, such that 


Np = nq" + ++ + + €2q? + aig’ + aog® 
Observe that 
“ = dngh! + + + + + ag! + aig? aed 
Integer Fraction 
=e Sy a. Yo 


q 
| Alno lows correctly called hexadecimal, 
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Tasie 2-1. Conversion TABLE 


aceasta na ean a an ene, 

















Decimal Binary Octal | Sexadecimal 
qq =10) | @=2) | @=8) (q = 16) 

0 0 0 0 

1 1 1 1 

2 10 2 2 

3 11 3 3 

4 100 4 4 

5 101 5 5 

6 110 6 6 

vé 111 7 7 

8 1000 10 8 

9 1001 11 9 
10 1010 12 A 
11 1011 13 B 
12 1100 14 Cc 
13 1101 15 D 
14 1110 16 E 
15 1111 17 F 
16 10000 20 10 
17 10001 21 11 
18 10010 22 12 
19 10011 23 13 
20 10100 24 14 
21 10101 25 15 
22 10110 26 16 
23 10111 27 17 
24 11000 30 18 
25 11001 31 19 
26 11010 32 1A 
27 11011 33 1B 
28 11100 34 1€ 
29 11101 35 1D 
30 11110 36 1E 
31 11111 37 1F 
32 100000 40 20 
33 100001 41 21 
34 100010 42 22 
35 100011 43 23 
36 100100 24 










[Cuap. 2 Suc. 2-2] PRINCIPLES OF PROGRAMMING 57 
Since the quotient and remainder are each unique, equating the fractional 
parts we have ao = ro and So = ang?! + +--+ + aegi + aig®. Divid- 
ing So by q and using our above results, we obtain 
So ay TL 
S=ag"+:°-: +ag+—=S8 = 
q 4 er ae ae: 


whence a; = 7, and so forth. 

Hence the systematic conversion method is as follows: Divide N, by q; 
the remainder is a. Divide the resulting quotient by q; the remainder is 
a,. Divide the resulting quotient by q; the remainder is a); ete. As an 


example, we have converted 28 (decimal) into binary (q = 2) in Table 
2-2. Hence, 


28 (decimal) = 11100 (binary) 


TaBLE 2-2, DecimaL TO BINARY CONVERSION (Np > 1) 


Remainders 
2128 
2|14 0 = a 
2/7 0O=a 
2|3 1 =a. 
2\1 1 =a; 
0 l=a 


‘The work can be more conveniently arranged from right to left, as follows: 


Divide by 2 


= 


J Place remainder 





0-1-8 —T7T—H< 282 


Ge lhe Taye abe 
1 1 1 0 0 Remainders 


Aw another example, let us convert 28 (decimal) to octal (q = 8), 


0O—38e— 28/8 
bd 
3.64 Remainders 


whence the converted octal number is 34. 
‘To convert 28 (decimal) to sexadecimal (a = 16), 
Oe— le 28)16 
A es alias 





———— 
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Hence.the converted sexadecimal number is 1C. [Here, of course, 
28 — 16 = 12 (decimal) = C (sexadecimal) from Table 2-1.) wae 
Next consider 0 <N,< 1. We wish to find nonnegative integer 


coefficients a1, G2, - . - » Gm, each less than q, such that 


N,= ang + aog?+:+* +a mgr” 
Here gNp=antargit-o° + a_mqo™t 
= uit v-1 


where u_; is the integral part and v_1 is the fractional part of gN p; hence 


U1s= 4-1 
and v4 = ag + eras + amg} 
Again qia=Getaag' to + amg ™t? = ue + 0-2 


whence wu_2 = @-2, ete. 


The conversion method thus is as follows: Multiply N, by q; the 
integral part is a... Multiply the resulting fractional part by 4; the 


integral part is a_2; ete. 


TaBLE 2-3. DecrmaL TO BINARY Conversion (0 < Np < 1) 


Integral parts 


0.28 

x 2 

a. =0 56 
x 2 

a. = 1 .12 
x 2 

a_3 =0 .24 
aa 

a4 =0 48 
x 2 

a_5 =0 -96 
x 2 

a6 = 1 .92 
x 2 

a_7 =I] .84 
x 2 

a_s = | .68 


As an example, we have converted 0.28 (decimal) to binary (q = 2) 
in Table 2-3. Hence 0.28 (decimal) = O1O00111 + = (binary). The 
work can be more conveniently arranged from left to right, as follows: 
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Multiply by 2 


Place 
integral | 
part 





2x 0.28 > .56 > .12 > .24 — 48 > .96 > 92 > .84 > .68 


v 1 1 u i { 
Integral parts 0 1 0 0 0 1 1 1 
Note that the process does not necessarily end, as it must for the integral 


iimbers, but may be carried out to any number of significant figures. 
Conversion to the octal and sexadecimal systems is similarly accomplished, 
lor nonintegral numbers greater than unity the integral part and the 
fractional part are done separately as just described. 

Note that in converting from binary to octal (or sexadecimal) we 
ivide, or multiply, the binary number by binary 1000 (or 10000), which 
js equivalent to shifting the fraction point to the left, or to the right, by 
ihvee (four) binary places. Hence for binary to octal conversion we 
slurt at the point and count off the binary bits in groups of three, to the 
loft and to the right. Then we write the octal equivalent for each 
of these groups of three bits. For example, if the binary number is 
11100.110100011, the octal equivalent is 


3 4.6 4 3 
011 100.110 100 O11 


Yo convert to sexadecimal, we count off similarly by groups of four. 
lor the same binary number, the sexadecimal number is 


1 C.D 1 
0001 1100.1101 0001 


(Meo Table 2-1 for binary-octal and binary-sexadecimal equivalents.) 
The converse will of course hold. 

Converting from Binary to Decimal. Another conversion method, par- 
\oularly useful for converting from binary to decimal, is based on the 
following factorization, illustrated for n = 4: 


N = aq + azq? + aoq? + arg! + aq? 
= (((aaq + as)q + a2)q + a1)q + a 


\{ V is a binary integer, then g = 2 and the a’s are either 0 or 1, but 
#, the most significant coefficient, must be 1. Hence, to convert NV 
fo decimal, start out with 1, and double it, obtaining 2; then add as, 
and double the result; add ag, and double the result; add a, and double 


7 


60 
the result; finally add ao. For example, 
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4 G3 G2 G1 40 


101 1 1=((24+0)2+12+1)2+1 
(4+1)2+1)2+1 

(0 + 1)24+1 

= 22+1= 23 


i 


ll 


The method is to accumulate the result while calculating, and the wor 


from left to right might look like this: 


Double 
Add 


0 1 = 23 


1 1 
VM 


Nee 


Similarly for N such that 
N = ang? + ang? + aang? + ag * 


we have the factorization 


1 1 
N = h(a + ; (as + (as + a+))) 


then q = 2 and the a’s are either 0 or 1. Th 
d by starting with the least. significa 
working toward the left as follow! 
and halve the result; ad 


If N is given in binary, 
decimal conversion is calculate 


unit (for example, a_, in our case), 
Add a_; to .5, and halve the result; add a_s, 


a_,, and halve the result. For example, 

1011 = 40+ 40+ KO + 0.5))) 
16(1 + 40 + 0.75)) 
14(1 + 0.375) = 0.6875 


is to accumulate while calculating, and the wo 


ll 


ll 


ll 


Of course the method 
from right to left might be arranged as follows: 


Halve] \ Add 
0 1 . 1 
okie Yodzs *045 0.5 


1 to decim 


| onversiOl 
This method can be used for octal and sexadecimal cor , 


simply by converting the octal or sexadecimal to binary first; ¢.g., 


1 . 
He: 53.53 (octal) = 101011.101011 (binary) 
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2-3. Number Systems: Arithmetic 


Addition and Multiplication Tables. We have already considered the 
urithmetic operation of counting in different number systems. Although 
wll arithmetic can be accomplished by counting, we are more familiar 
with the laborsaving operations of addition, multiplication, subtraction, 
wnd division. Of course these operations may also be carried out in 
other than the decimal number system. Addition and multiplication are 
twsentially carried out by means of tables. For decimal, binary, and 
octal the tables are shown in Table 2-4. 


TaBLe 2-4, ArirHmMeric TABLES 


DecimaL AppITION TABLE DecimaL MUuLTIPLICATION TABLE 














t}O 12 3 4 5 678 9 X/0 1 2 3 4 5 67 8 9 
0}0 1 2 3 4 5 6 7 8 9 0}|0 00 0 0 0 0 0 0 0 
1} 1 2 3 4 5 6 7 8 910 1/0 12 3 4 5 6 7 8 9 
)2 3 4 5 6 7-8 91011 2/0 2 4 6 8 10 12 14 16 18 
H)3 4 5 6 7 8 91011 12 3/0 3 6 9 12 15 18 21 24 27 
4/4 5 6 7 8 910 11 12 13 4/0 4 812 16 20 24 28 32 36 
6}5 6 7 8 9 10 11 12 13 14 5/0 5-10 15 20 25 30 35 40 45 
‘}6 7 8 9 10 11 12 13 14 15 6/0 6 12 18 24 30 36 42 48 54 
¥)7 8 9 10 11 12 13 1415 16 7/0 7 14 21 28 35 42 49 56 63 
§}8 9 10 11 12) 13 1415 1617 8/0 8 16 24 32 40 48 56 64 72 
#} 9 10 11 12 13 14 15 16 17 18 9/0 9 18 27 36 45 54 63 72 81 








Binary Muttipiication TABLE 





x 0 1 
0 0 0 
1 0 1 











xi 0 1 2 3 4 5 6 7 
01:0 0 0 0 0 0 0 0 
2}0 2 4 6 10 12 14 16 
3]0 3 6 IL 14 17 22 25 
4/0 4 10 14 20 24 30 34 
5|0 5 12 17 24 31 36 43 
6/0 6 14 22 30 36 44 52 
7/0 7 16 25 34 43 52 61 
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next instruction. Hence a typical four-address instruction format is as 


Arithmetic Operations. In both addition and subtraction the “carry” 
follows (where the four addresses are denoted by a, 8, y, and 6): 


or “borrow” is handled in other number systems exactly as it is handled 
in the familiar decimal system. Of course the addition tables are used 
during the calculation. For example, 












Address of 
next in- 










Operation | First operand | Second oper- | Put result in- 













1 1 0 1 (binary) 7 5 7 (octal) code address a | and addre 
Ss to add 
4, 1 0 1 (binary) + + 5 6 (octal) By to address ¥ | struction 8 
10 0 1 O (binary) 1 0 3. 5. (octal) i 

7 . ‘or any specific computer using a four-address instruction format, th 
1 : ; Cinch e 7 ; : ee umber of bits in each address and in the operation code must be ae 
S5.p 1 ( inary) sae 7 : (octal) in well as the actual operation code itself. In the design of a computer 
0 1 1 1 (binary) 6 6 5 (octal) (he number of bits reserved for an address presents an upper limit to the 


umber of words in the addressable memory of the computer. If an 
nddress is denoted by n bits, no more than 2” words can be contained in 
the directly addressable memory. 

Octal Shorthand. In order to illustrate the programming techniques 


Multiplication and division in other systems also follow the rules 
familiar to us in the decimal system, using the appropriate multiplicatio 
and addition or subtraction tables during the calculations. For example 





1011 (binary) 775 ~~ (octal) iloveloped in this section, we assume that there are 9 bits in each address 

xX 101 (bina ry) x 56 (octal) (2 = 512 addressable memory words) and 6 bits in the struction scale, 
ATE 5756 i total of (4 X 9) + 6 = 42 bits in a word. This brings up the first 
0000 4761 Important detail of coding: the actual bits in an instruction are not 
s0ti 55586 (octal) Written out; rather, some shorthand is written instead. For example, in 





110111 (binary) Hur case the bits would be divided into 14 groups of 3 bits each, and the 


1011 (binary) 
101|110111 (binary) 
101 
111 
101 
“101 
101 


2-4. Sequences of Instructions 


We shall initiate our detailed discussion of programming using the fou 
address instruction system. That is, in this section we shall consid 
instructions which explicitly include four addresses, since a four-addre 
instruction system includes all the essential ideas or basic ingredients 


an instruction. 


Instruction Format. As we have previously seen, some bits of thi 
instruction are set aside for the operation code designation—i.e., the 
tell the computer the instruction is add, multiply, divide, ete. The re 
of the bits usually denote the four addresses. For the more usual oper 
tions that involve two operands, such as addition, multiplication, ete 
two of the addresses are the addresses of the operands, The third addre 
tells where the result is to be put; the fourth address, where to obtain th 


velul equivalent of each group would be written out. In other words, 
{wo octal numbers would represent the instruction, and each address 
would be represented by three octal numbers. Thus, if 101 011 is the 


775 (octal) 
56[55566 (octal) 


502 
“538 hinury code for add, then the instruction that says, ‘‘Add the contents 
502 of address 011 010 110 to the contents of address 011 100 101, put the 
“346 fowult, into address 011 110 100, and take the next instruction from 
346 100 000 001,” is written in octal notation as 


Operation a B y 5 










53 





Ii) such cases it evidently facilitates matters to call the addresses in the 
memory by their octal numbers. It will be assumed throughout the rest 
of this section that such is the case. Also, numerical quantities will be 
written on the code sheet in octal (i.e., they will have to be converted 
from decimal to octal before being written on the code sheet). 

The Computer Manual. Yor our computer we must have a “computer 
iiunual’ that gives the operation codes of the different instructions and 
‘ilmwo precisely defines the meaning of the addresses for each instruction 
type. Tor example, see Table 2-5. As can be clearly seen, the coding 
Wiinual must always be at the coder’s side, Two further observations 
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must be reemphasized before we can proceed with an example. First, 
when a word is called into the arithmetic unit from the memory, it 1s not 
erased from its memory address, but remains there also. Second, when a 
word is put into a memory address, it replaces the previous contents of this 
address, i.e., it erases what had been there. 


Tasie 2-5. OpERATION CopES 
8 


Code | Operation Meaning 





Add contents of address a to contents of address 8; put sum in 
address y; take next instruction from address 6 

Multiply contents of address a by contents of address 8; pul 
product into address 7; take next instruction from address 6 

Subtract contents of address 6 from contents of address a; pu 
difference into address y; take next instruction from address 6 

Divide contents of address a by contents of address 6; put quotien 
into address y; take next instruction from address 6 


53 | Add 
42 | Multiply 
54 | Subtract 


41 | Divide 








Example. As we have noted’ above, a sequence of instructions i 
required for a problem. We shall illustrate such a sequence for a cod 


to evaluate 
Y= V oyt = l6gt? 


where Y is the altitude of a missile projected with an initial Y velocit 
component of Vo, computed at time #. Suppose that Voy = 1,000 fp 
g = 32 ft/sec’, and t = 53 sec. Now 1,000 (decimal) = 1750 (octal) 
32 (decimal) = 40 (octal), and 53 (decimal) = 65 (octal). Let us put th 
numbers 
1750 into address 10 | 
40 into address 11 4 
65 into address 12 


and 2 into address 13 


Let us reserve address 14 for the result of the product Voyt, address 1 
for t, address 16 for gt’, address 17 for Jagl’, and finally address 20 fo 
Vo,t — W4gt?. The first instruction would be ‘“‘Multiply the contents 
address 10 by the contents of address 12, and put the result into addre 
14,” where we have yet to indicate where the next instruction will 
found. This is written as 


Operation a B 


42 010 | O12 | O14 


Now let us put this instruction into address 1. Next we would wri 
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in instruction to form @ and put this instruction into address 2. Now 


we can finish the first instruction, 
42 010 012 014 002 


The third instruction would multiply # by g; the fourth instruction 
would divide gt? by 2; and the last instruction would subtract gt? 
from Voyé. A code is written on a coding sheet, which usually has the 
left-hand column labeled with addresses and the remaining columns 
lubeled with the parts of an instruction. In this way a clear account is 
made of exactly which instructions are located in which addresses. On 
4 coding sheet our code would look like Table 2-6. The student should 
eurcfully check each instruction in this code to be sure that each does what 
the “remark’’ says it does. There are several things missing from this 
tode: How do we get the computer started, and how do we stop it when 
the job is finished? 


TaBLE 2-6. Cope To CompuTE Y = Vo,t — 14gt? 























Instruction 
Address Opava: Remarks 
tion re B 7 : 

000 

001 42 010 012 014 002 Form Voyt 

002 42 012 012 015 003 | Form @2 

003 42 O15 011 O16 004 =| Form gi? 

004 41 016 013 O17 005 Form Mgt? 
005 54 014 017 020 006 Form Y 

006 See text 

007 

O10 00 000 000 001 750 Voy constant 
O11 00 000 000 000 040 g constant 

012 00 000 000 | 000 065 é constant 

013 00 000 | . 000 000 002 2 constant 

014 Vo,t temporary 
O15 t? temporary 
016 gt? temporary 
O17 14g? temporary 
020 Y = Vo,t — légt? 
































‘here are many systems in use for initiating a code on a computer. 


Vor example, after the code is read into the computer, the computer 
Wight automatically take the very first instruction from address 000. 
1) our example we would then have a dummy instruction, say “Subtract 


fontents of 007 from contents of 007, put result in 007, and take next 
thalruction from 001,” 


We used 007 because our particular code does 
























66 MACHINE LANGUAGES (Cuap. 2 


not involve 007. In order to stop the computer when the computation 
has been completed, the last instruction should be one that says simpl 
“Stop the computer.” For instance, suppose that 60 is the operation 
code for such an instruction; then we could put 


60 000 000 000 000 


in address 006. In such an instruction the addresses a, B, y, and 6 have 
” The ‘ee of Temporaries. Memory space in a computer is often at ¢ 
premium and should be conserved. From this point of view the abovi 
code is not satisfactory. For note that, after t2 has been computed 
address 015 is not used again. In other words, we could Have one f 
gt? and put this result back into 015, and then computed Légt? and put 
this back into 015 also, eliminating the necessity for using 016 and Ol 
Similarly, after computing Y, we could put this result into 015, eliminat 
i se of 020 also. 

ere sean up an important point about addresses reserved for ny ‘ 
bers. There are two kinds of numbers contained in addresses: per 
manent,” or constant, numbers, such as 2 and g, and temporary, 
or transient, numbers, i.e., numbers that appear only in ake f 
stages of the computation. These temporary numbers can share t e 
address with other temporary numbers that appear at some other tim 
Hence we call such memory locations temporartes. In our examp! 
above, 015 would be such a temporary. The adjusted code becomes as 
Table 2-7. 


TaBLE 2-7. Use or TEMPORARIES 






Instruction 





Address Remarks 
re 


Opera- 





























tion 1 B % 
000 54 007 007 007 001 | Take first instruction from O01 
001 42 010 012 014 002 | Form Vo,t 
002 42 | o12 | 012 | 015 | 003 |Form# 
003 42 015 O1L 015 004 
004° 4l 015 013 015 005 
o0s- | 54 | O14 | 015 | 015 | 006 
006 60 | 000 000 000 000 
007 j 
010 00 000 000 001 750 | Voy constant 
O11 00 000 000 000 040 |g constant 
012 00 000 000 000 065 | ¢ constant 
013 00 000 000 000 002 | 2 constant 
» 014 | Voyt temporary ‘ 
O15 Temporary, where final result 








found 
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2-5. Decisions 


The Comparison Instruction. We shall next give an example to be 
woded where a simple decision is made by the computer. Such decisions 
ure usually made by comparison instructions. For example, a comparison 
instruction might operate thus: ‘Compare the contents of a with the 
wontents of 6; if the contents of address @ is greater than the contents 
of address 8, then take the next instruction from address y; otherwise, 
luke the next instruction from address 6.” A few simple conventions 
tun greatly shorten such a statement: Let («) stand for “the contents of 
address &”; and let the word take mean “take the next instruction from.” 
Then the above statement becomes “If (a) > (8), take 7; if (a) < (8), 
luke 6.” Let the operation code for this instruction be 43. 

Kxample. Now consider the code for the following problem: Three 
iumbers, U, V, and W, are found as the contents of addresses 015, 016, 
und 017, respectively. We wish to write a code that will put the largest 
of these numbers into address 020. First note that in any case we shall 
wed three instructions that transfer the contents of addresses 015, 016, 
und 017, respectively, into address 020, and the rest of the code is to 
(locide which of these three instructions to execute. The transfer prob- 
lom is easily taken care of by adding the contents of the desired address 
0 the constant zero and putting the result into 020. (Hence we need an 
wildress whose contents is zero.) Let us put the three such instructions 
jnlo 005, 006, and 007 (see the code below). To determine which of 
these to execute, let us first compare (015) 2 (016). If this is the case, 

then we would compare (015) 2 (017). If this is also the case, then 
(O15) > both (016) and (017), that is, U > V and U > W, and hence 
we would execute (005), i.e., transfer (015) to 020. If 015 < (017), 
then (017) > (015) > (016), that is, W > U > V; hence we transfer 
(O17) to 020, i.e., execute (007). If (015) < (016), then compare 
(016) 2 (17). If this is the case, (016) > both (015) and (017), that is, 
\ =» U and V > W; hence we transfer (016) to 020, i.e., execute (006). 
If (016) < (017), then (017) > (016) > (015), that is, W >V > U; 

hence transfer (017) to 020, i.e., execute (007). The code is shown in 
Table 2-8, 

Several general observations can be made that are exemplified by this 
wade, Tirst, coding is a very exacting technique: every path of every 
jnmtruction must be followed carefully, for a single error invalidates the 
mitire code, Second, note that when using a comparison instruction 
(ho significance of the < must be carefully considered, as well as that of 
the >. Reading a code does not teach one how to code—actually writing 
i code does, Most students will undoubtedly feel uneasy about the 
above example until they rewrite it, doing the detailed thinking them- 
solvos, using the example only as an occasional guide and final check. 


re 
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The Sign Bit. Up to now we have been making two tacit ae 
tions, namely, (1) that all numbers are whole numbers (i.e., a“ "4 
fractional part) and (2) that all numbers and results are positive. F eh 
the assumptions are not necessarily true. They were made in order to 


TABLE 2-8. Copr To DETERMINE LARGEST OF THREE NUMBERS 










































Instruction 
Remarks 
Address Onan . : 7 ; 
tion 
021 021 001 | Take first instruction from 001 
000 54 021 | ae 
001 43 015 016 002 003 |U>V!. ; aN 
Yes, try U> W; ie., take 
| No, try V > W;ie., take 003, 
005 007 |U>W? . , 
sy ms a a Yes, U is largest number; i.e., ) 
| take 005 . 
No, W is largest number; i.e., 
take 007 
006 007 |V>W? ; 
me e ue ia Yes, V is largest number; i.e., 
take 006 ; 
| No, W is largest number; i.e., 
take 007 
ons 53 015 021 020 010 | Transfer U to largest number 
cell 
06 53 016 021 020 010 | Transfer V to largest number 
: cell 
07 53 O17 021 020 010 | Transfer W to largest number 
: cell 
010 60 000 | 000 | 000 | 000 | Stop computer 
O11 
012 | 
013 
014 | . 
ote | three given numbers 
W 
oe Largest number cell 
021 00 000 000 000 000 | 0 constant 
































aie ce ee ee ee ee 


concentrate on the coding problems so far considered and avoid ne 
sary complication in those discussions. We shall defer full pone at 4 
of the first of these problems for a later section, remarking here sale , 
a computer can consider numbers whose magnitudes lie mipbin a ae k 
range, which depends on the construction of the computer. 
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been illustrating a computer with a range of integers from 0 through 
2" — 1. Hence, before a problem is put on a computer, it must be 
“xcaled”’ so that all numbers that appear will be whole numbers. Simi- 
larly, problems must be scaled to fit on other computers, depending on 
(heir ranges. The second problem is solved as follows: A single bit is 
udded on the left to the length of the word, making the word 43 bits long. 
If this bit is a zero, then the machine is wired to interpret the number as 
heing positive; if this bit is a unit, then the machine will interpret the 
umber as being negative. The circuitry in the arithmetic unit must 
sense the signs of the operands and compute the proper values for this 
wo-called segn bit in the results. The meaning attached to this 43d bit, 
is far as an instruction is concerned, is different for different computers 
nd is discussed below; it is not a fundamentali ngredient of the instruc- 
lion. On the coding sheet, the sign bit is noted by affixing a + to the 
loft of the number if it is positive or zero, and a — if it is negative. 


%-6. Flow Charting and Symbolic Code Aids 


A flow chart is a diagram, or picture, of a code that is often helpful for 
Visualizing interrelationships between various parts of a code. Such a 
ingram is almost always made before the specific instructions are writ- 
lon. There are essentially three kinds of symbols used in a flow chart 
(woe Hig. 2-1). The first represents function calculation; the second repre- 
ments decisions and the various associated alternatives; and the third, 


{Ria} a “2 © 


Function calculation Decision calculation Connector 


wulled a (variable) connector, is simply a way to eliminate too many cross- 
ie lines in the picture or to indicate which lines to follow when one has to 


Fic, 2-1. Flow-chart symbols. 


Fic, 2-2, Simple flow chart. 


ontinue the diagram on another page. 


lor illustration, consider the example of Sec. 2-4. The functions V oyt 


and ‘og are to be calculated, and their difference is the final desired 
fowult, 


Henee we would draw the flow diagram in Fig. 2-2, The first 


eee 
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(left) box represents the instruction in address 001; the second box repre- 
sents the set of instructions in addresses 002 to 004; the third box repre- 
sents the instruction in address 005. The arrows indicate the order in 
which the computer does the indicated computing. Note that the sym- 
bols representing intermediate results appearing on the left-hand side of the 
equations, namely, « and y, appear later in the chart on the right-hand side 
of an equation. Only symbols that signify end results, like Y, will not 









TABLE 2-9, EXAMPLE oF SYMBOLIC CopE 


some name description or other symbolism to represent the addresses. 
Then, at a later time, specific addresses can be assigned for these symbols, 
or names, to produce the actual code. The intermediate code in terms 
of symbols is called the symbolic code. This technique is extremely use- 
ful, particularly in those cases where one must write instructions involv- 
ing addresses of constants or of other instructions that have not yet been 
specifically assigned. For example, this difficulty is encountered in writ- 
ing the code for the decision example of the previous section, since when 





























Instruction 
Address Seis 
tion & B Y 6 
U:V CMP U vV U:W V:w 
U:W CMP U WwW ULN WLN 
V:w CMP vV WwW VLN WLN 
ULN | ADD U $ LNC | S 
VLN ADD Vv o LNC S 
WLN ADD WwW o LNC s 
Ss 
U 
Ww Vv 

is largest WwW 

LNC 
Fic. 2-3. Flow chart including decision boxes. MeN Lh ee ee oe 












appear on the right-hand side of an equation. This observation eve 
superficial way to check that allintermediate results are used appropriate 
A more complicated flow diagram is associated with the problems 
Sec, 2-5. Here decision boxes are used (see Fig. 2-3). The lines leadi 
away from the ovals represent the different paths the computer will tak 
depending on the outcomes of the decisions. The little symbols (> a 
<) near the lines represent the conditions under which the program W 
go in the corresponding directions. The proper corresponding condition 
indicated when the symbol replaces the colon; i.e., the computer will follo 
the left-hand path when U > V and the right-hand path when U< 
The use of the connector is clear. The top decision box represents (001 
on the next lower level the left-hand box represents (002), the right-ha 
box (003). On the next lower level the left-hand function box represen 
(005), the right-hand box (006); and the lowest box represents (007). 
Symbolic coding is another intermediate aid between the stateme 
of the problem and the final code, Symbolic coding consists in writi 
a code not in terms of specific numerical addresses, but rather in terms 


that is, (0) = U. Vis the address such that (V) = V. 


ienignment of addresses then is shown in Table 2-10. 


Writing the instruction for U:V, the coder must refer to the not as yet 
written or located instructions for U:W and V:W. A symbolic code 
for this example might be as shown in Table 2-9. The symbolism of this 
wode is as follows: U is the address whose contents is the number U— 
W is the address 
wich that (W) = W. U:V is the address whose contents has the compare 
inatruction concerning U and V, and similarly for U: W and V:W; ULN is 
the address of the instruction that puts the number U into the largest 
iimber cell, and similarly for VLN and WLN. LNC is the address of the 
largest number cell; @ is the address whose contents is the constant zero; 
wud § is the address of the stop instruction. CMP stands for the com- 
pare operation code, and ADD stands for the add instruction code. 


Sleps in Writing a Code. The procedure for programming or coding 
jn ue follows: (1) Understand the statement of the problem. 
i flow diagram for the problem. (8) Code symbolically each box of the 





—————————— 
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flow diagram. (4) Assign addresses. (5) Write out the final detailed 
code. The flow diagram presents an over-all visual picture of the prob- 
lem, organizing the coding effort so that it can be concentrated on one box 


at a time. Symbolic coding enables an instruction to be written before 


Tasie 2-10. ASSIGNMENT OF ADDRESSES 


Symbolic name Address (or code) 


U:V 001 
U:W 002 
Viw 003 
ULN 005 
VLN 006 
WLN 007 
Ss 010 
U 015 
Vv 016 
WwW 017 
LNC 020 
> 021 
CMP 43 

ADD 53 










the coder knows the addresses of constants and other instructions refer 
enced by this instruction. In addition, since the symbolic code name 
for these addresses are suggestive of their contents, the symbolic code i 
easier to write and follow. 


2-7. Recursion Codes and Instruction Modification 


Loops in the Flow Diagram. In Sec. 2-4 we considered the problem 0 
determining the altitude at a time ¢ of a missile projected with an initi 
y component of velocity Vo,. A more realistic problem would be 
determine the trajectory of the missile, i.e., the position of the missil 
say, after each 10 sec of its flight. For such a problem we use a loop, 
recursion code, i.e., a code which uses a certain set of instructions ov: 
again several times. Now at any time ¢;, the x and y components 
position will be 

xi = Vodli 
yi = Vows — Page? 


where Voz is the initial « component of the velocity. To be concret 
suppose that Vor = 2,000 fps, Vo, = 1,000 fps, and g = 32 ft/see 
Then at time t: (= 10 sec) we would have 21 = 20,000 ft and 


yx = 10,000 — 1,600 = 8,400 ft 
At time tz (= 20 sec), v2 = 2,000 X 20 = 40,000 ft, and 
yo = 1,000 X 20 — 16 * 20? = 18,600 ft 
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At time ts (= 30 sec), v3 = 2,000 X 30 = 60,000 ft, and 
ys = 1,000 X 30 — 16 X 30? = 15,600 ft 


eae forth. During such a computation it is clear that the same for- 
a as are used over again, each time increasing our t; by 10 sec. How 
ever, the computation should stop when the missile hits the growid ice 

, Le. 


TaBLeE 2-11. CompuraTION or TRAJECTORY OF MISSILE 














7 hy xy Yi 
1 10 20, 000 8,400 
2 20 40,000 13,600 
3 30 60, 000 15,600 
4 40 80, 000 14, 400 
5 50 100, 000 10, 000 
6 60 120, 000 2,400 
7 70 140, 000 —8, 400 
ee a. 
When y; is no longer positive. In our case it is easily seen that this is 


when ¢ = 70 and y; = 70,000 — 16 
| j _ xX 70? = —8,400 ft. Table 2-11 rep- 
fowents the final results, and Fig. 2-4 is the flow chart of the process. a 








Read in the initial conditions 
Vo, = 2,000 
Vo, = 1,000 

t;= 10 
1 





i= 










Compute x;= V, t; 
v= Yy t-3 at 
Store X59; it; 








Let ¢;,., =¢;+10 
l.e., increase ¢ by 
10sec 







Print out 
Stop the computer 
tui NH 


Via, 2-4, Flow chart for computation of missile trajectory. 
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the flow chart we have introduced the 7 notation, where #;,1 represent 
the next time around and i + 1— 7% means that for the next iteration w 
replace the old ith values with the new (i + 1)st values. 

With this illustration it is seen that there are four basic ingredient 
to a recursion code. It must contain (1) a set of instructions, called th 
iteration instructions, that are to be reused; (2) another set of instruction 
that modifies the original set each time around; (3) a set of instruction 








Proceed with 4 
iteration i 


Tally: should the 
iteration continue, 
or is the process 
over? 














Modify the iteration \ 
for round i+1 







Continue 
iteration 


Exit 


Reset the loop 


A 


F1q. 2-5. Flow chart of generalized loop. 


often called a tally, that determines when to exit, or break out of the lo 
and appropriately notifies the computer; and (4) a set of instructio 
that sets up the initial conditions and starts the loop. In addition 
recursion code often contains (5) a set of instructions that resets the | 
so that it may be used again by the computer at some future time. 
generalized loop can be indicated by the flow diagram of Fig. 2-5. 
Sometimes the tally consists of instructions determining whether 
not the result of each iteration is smaller than some given number, 
occurs often in function computations; or the tally may just count t 
number of iterations until the desired number have been accomplish 
As an example of the former tally, consider the computation of /N 
within an error of ¢ by the following process (see Fig. 2-6): First m 
that ~/N is the x (and the y) coordinate where the curves % = y 
N = zy intersect. We approach this point by suecessive approximati 
starting from (ao,yo), Where vo = N, yo = 1, moving along a perpendi 
lar to a = y, dropping an altitude to point (ay) on N = xy, movi 
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from this point along a perpendicular to x = y, dropping an altitude to 
point (v2,y2) on N = ay, and so forth. The next point (7i41,yi41) is found 


y axis 


(x2, Y2) 


(1,91) (Xo, Yo) 





N x axis 


Fig. 2-6. Computation of VN. 





Fia. 2-7. Flow chart for computation of VN. 


ji terms of the present point (2;,y;) as follows: 


vit Yi N 
2 Yi41 = - 


Li+t — 








(woo Fig, 2-6), 


bined When a — yi <«, the desired accuracy has been ob- 
ied, 


The flow diagram for this loop is shown in Fig. 2-7. A symbolic 
vode for this appears in Table 2-12. 
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As an example of the latter tally, consider a code that computes x" 
where x and n are known. Here, after each multiplication, 1 is added 
to the tally temporary, and comparison is made with the constant n. a 
symbolic code for this is given in Table 2-13. Note that initially the 


TABLE 2-12. THe SQuaRE-ROOT ROUTINE 








Address | Operation a B OY 65 Remarks 
000 oN 
001 Divide N x y 002 Yiu = a, 
002 | Subtract x y A 003 Tent = Yiq1 
003. =| Compare A € 005 004 =| Ave Vi 
004 =| Stop ain 2 = Dey, 
005 | Add x y x foes _ ti : Yi 
006 | Divide x 2 x 001 
007 
010 : 
O11 
012 y 
013 : 
014 
015 : 
016 
| nn 
























































TaBLE 2-13. 2” ROUTINE 
en 


















































F Ik 
Address | Operation | « B ¥ 6 Remarks 
000 es 
i in a” temporary 002| Form partia 
001 Multiply | x | 2" temporary sclee 
j lly 
Tally temporary | Tally temporary |003] Adjust ta 
002 | Add 1 ally temporary oe 
003 | Compare | n | Tally temporary | 001 004 
305 ree a” temporary 
ie Tally temporary 
007 : 
010 : 
O11 
I 


contents of the x" temporary must be 1, and the contents of the (all 

temporary must be 0 (why?). ice 
The importance of the recursion code cannot be overempharize. a 

fact, as we shall see in the next chapter, special instructions are incluce 
rt, ) 

in many computers to aid the coding of such codes, 
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Instruction Modification. One of the fundamental concepts upon 
which modern computers are based is that of instruction modification, 
i.c., having a program automatically modify itself. Let us illustrate how 
instructions can modify instructions by means of the following examples: 

Suppose that, during the course of computing for a problem, the com- 
puter often needed to know the value of the binomial distribution function 


plc) = ee (0.1)7 (0.9)!°°-*, where fag) is the binomial coefficient, for 


w= 1, 2, 3, 4, or 5. One method for performing such a calculation is 
simply to write a table into the memory of the computer and “look up” 
(he value each time it is needed. 

Suppose such a table was written into addresses 001 to 005; ie., in 
uddress 001 is located 10‘p(1), in address 002 is located 10‘p(2), . . . , in 
105 is located 104p(5). Note that we must scale the values of p(x), since 
p(r) is less than 1 (for all values of x) but our computer interprets all 
iiimbers as greater than 1. Of course the numbers in addresses 001 to 
005 will be written in octal. 

Suppose also that the value of x (which for this problem can only be 
|, 2, 3, 4, or 5) is found in the 8 position of address 006 and that the 
result, namely, 10‘p(x), corresponding to this value, is to be put into 007. 
‘To perform this task, the code must have an instruction that will transfer 
(he contents of one of the addresses 001, 002, 003, 004, or 005 into 007; 
such an instruction (where the contents of 010 is the constant zero) is 






Operation | a B Remarks 





Add 010 ? base 0 to ? and put in 007 


I{ the contents of 006 were 1, then 001 should replace the ?; if (006) = 2, 
(then 002 should replace the ?; ete. The problem therefore resolves 
\inelf into modifying the instruction with the ? so that ? is replaced 
hy the contents of 006. One way of doing this is to write the instruction 
lirnt as 53 010 000 007 013, then to add (006) to it, the results being 
h4 010 001 007 013, and so forth. (Note that we have anticipated 
Placing this instruction in 012 and hence its 6 is 013.) But after the 
vole has been used, an additional instruction is needed to reset it. The 
flow diagram is as in Fig. 2-8. The entire code, then, is as in Table 2-14. 


Start Set up the read- Read the Reset the read- 
( table instruction table table instruction 


ig, 268, Flow chart of table looking-up routine, indicating reset instructions, 


a 
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TaBLE 2-14, BINOMIAL-COEFFICIENT Look-up RovutinEt 














Address | Operation | @ B y 5 Remarks 
000 54 010 | 010 | 010 | 011 | Take first instruction from 011 
O01 00 000 | 000 | 000 | 003 | 104p (1 ) 
002 00 000 | 000 | 000 | 020 | 104p (2) 
003 00 000 | 000 | 000 | 073 | 10‘p(3) 
004 00 000 | 000 | 000 | 237 | 104p (4) 
005 00 000 | 000 | 000 | 523 | 104p(5) 
006 00 000 | 004| 000 | 000} x = 4 (or 1, 2, 3, 5) 
007 Result = 104p(x) 
010 00 000 | 000 | 000 | 000 | 0 constant 
O11 1 53 006 | 012 | 012 | 012} Set up ? instruction 
012 53 010| ? |007| 013) This is the J instruction 
013 53 010 | 015 | 012 | 014| Reset the ? instruction 
014 60 000 | 000 | 000 | 000} Stop 
015 53 010 | 000 | 007 | 013 



























































+ See operations defined in Table 2-5, p. 64. 


Instruction M odification in Loops. Frequently instruction modification 
is necessary for the coding of loops. For example, suppose that it is 
desired to form the sum of a long column of numbers which are located in 
consecutive addresses of the computer memory from address 030 to 077. 















Modify the add 
instruction for 
the next number 
















Initiate the process: 
put the first number in 
the partial sum cell 









Add the next 
number to the 
partial sum 





077: 
address of number 
just added 





Reset add 
instruction and 
stop computer 


Fra, 2-9, Flow chart for instruction modification, 
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We could use the same add instruction for the successive additions pro- 
vided that the add instruction were modified before each addition so as 
to add the contents of the nexé successive address to the partial sum 
cach time round. The flow diagram is as in lig. 2-9 and the code as in 
‘Table 2-15. The arrows in Fig. 2-9 indicate the path of the computer 
(hrough the instructions. The sum is formed in address 001. The stu- 


dent should be sure to understand the purpose of each of the instructions 
of Table 2-15. 


TABLE 2-15. COLUMN-ADDING ROUTINE 


a 





Address || Operation | a@ B y 6 Remarks 

000 54 010 | 010 | 010 | 002 | Start at 002 

001 Final sum (temporary partial sum) 

002 53 010 | 030 | 001 | 003 | Put first number into partial sum 
temporary 

003 53 O11 | 004 | 004 | 004 | Increase 6 of add instruction by 1 

004 53 001 | 030 | 001 | 005 | Add instruction: add next number 
to partial sum 

005 43 012 | 004 | 003 | 006 | Tally: is 077 > 6 of 004? 


Yes: add another number 
No: sum is complete 


006 53 010 | 013 | 004 | 007 | Reset add instruction 
007 60 000 | 000 | 000 | 000 | Stop 

O10 00 000 | 000 | 000 | 000 | 0 constant 

Ol 00 000 | 001 | 000 | 000 | 1 in B constant 

O12 53 001 | 077 | 001 | 005 | Constant with B = 077 
O13 53 001 | 030 | 001 | 005 | Constant with 6 = 030 


030 
Numbers to be added 








077 


























Loops within Loops. As an example of multiple loops, or loops within 


loops, consider the computation of a table of sin 2 by means of the 
infinite series 


x xe gi xv gu 


— 31751 717 91 im 


sinz =2 





for each 490 radian from 0 to 7/2, to eight decimal places. First we 
would have a loop that formed x"/n! by multiplying a partial product 
successively by a/P; (loop A); then we would need a loop that adds or 
subtracts this result to or from the partial sum and increases n by 2, 
until the partial sum becomes correct to eight significant figures (loop B); 
wid finally we need a loop that increases a by 0.01 and continues to 








a 
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compute the next value of sin z (loop C). The flow diagram for this 


process appears in Fig. 2-10. 





Set n= 1 and the contents 
of "partial sum temporary" 
cell equal to 0 















Set P= 1 and contents 
of "partial product temporary" 
cell equal to 1 







Multiply partial 
product by = 





x,+0.01—> x, 


Add (or subtract) 
= to partial sum 
n: 









= : 0.000000005 





fl 





Print out the value of x, 
and the corresponding 
value of sin x, 






Stop the computer 





ithi i ; +1 Pin; +27 Ny 
- hart of loops within loops. The notation Ps + Ping + 4 
aaa ier ee a etc., i often used to indicate that in the (¢ + 1)st iteration 


Pigs = Pi +1, nyu = 07 + 2, and ae41 = t + 0.01, ete. 


2-8. The Subroutine 

Use of a Subroutine. A subroutine is a subcode that mney = a 
many times during the computation of a program but is writ en ong 
once in the whole code. As the computer proceeds down the rire fe 
gram, the control will occasionally jumpt to this subroutine an _ 
after doing the subroutine, will jump back to the main proares Ww - 
it left off. This detour from the main program through the subroutin 
f “take the next instruction from,” 








j is commonly used in the sense 0 : 
ie ee to ii wnt means “take the next kgs f ron 
address where the first instruction of the subroutine is located. : : ur a 
purposes a jump instruction usually refers to an instruction whose 6 addr 
the jump in question, These terms should be clear in context, 
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may occur several times during the computation of the program. Hence 
« subroutine must have an entrance, a way of getting into it, and an 
evit, a way of getting out of it. Each time an entrance is made to a 
subroutine, some initial conditions must be set up that are character- 
istic of the place in the main program from which the entrance was 
made. For instance, if the subroutine calculates some function, the 
initial values of the independent variables at that point in the main 
program must be given to the subroutine. In addition, as an entrance 
{o a subroutine is made, the exit must be set up; i.e., the subroutine 
must be told where to transfer control back to the main program. For 
ouch time the subroutine is used, its exit is usually to some different 
part of the main program. Hence, in order to use a subroutine, the coder 
inust know (1) the entrance, i.e., the address of the first instruction; 
(2) the addresses of the temporaries in which the initial conditions are 
lo be set up; (3) the addresses of the temporaries whose contents will 
he the results of the subroutine computation; and (4) the exit, i.e., the 
uddress of some jump instruction that is to be preset (when entering the 
subroutine) so that, when the computation of the subroutine has been 
wompleted, the computer will transfer control back to the proper address 
of the main program. 

Consider, for instance, the flow diagram in Fig. 2-11, which illustrates 
(he setting up of the initial conditions and different exits of the sub- 
routine. The A in the circle is a connector and of course means that the 
program jumps to the subroutine at that point. The B connector is 
ulled a variable connector; i.e., it indicates that the program jumps either 
0 B, or to Bs, depending on how the “‘variable” connector is set. When 
Wwe say that B = B,, we mean that the B connector is set so that it will 
go from B to B,; and when we say that B = Bz, then the B is set to go 
from B to By. The rest of the flow diagram is self-explanatory but 
should be studied carefully by the student. 

Nvample. As a specific example of the use of subroutines, consider a 
program which is to compute the radiation intensity S of an antenna at 
(i point in space as a function of the directional angles @ and ¢ and the 
radius R given by the equationt 


7 . (nr. 
30 To? cos ( cos a) sin ( sin 6 cos e) 
aa Qn R? sin 6 an 
sim {5 sin 6 cos ¢ 


2 





| 


where n = a given integer 
/) = a known constant 


| Soe, for example, J, A, Stratton, “Electromagnetic Theory,” p. 451, eq. (53), 
MoGraw-Hill Book Company, Ine,, New York, 1941, 


a 
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Now the sine and cosine functions are evaluated by means of their po 
series. Since these functions appear several times a 
ent arguments, it would be wise to use their respective power-series codes 
as subroutines. Figure 2-12 is a flow chart for the calculation of S and 


indicates the use of the subroutines. 


This flow diagram can be used to illustrate 
in using subroutines. For example, suppose 








Set up first exit of subroutine, 
ie., let B= Bi 


Set up second exit of subroutine, 
i.e., let B= Bo 





Main program continued 
' 


Fig, 2-11. Flow chart for setting up initial ¢ 
subroutine, 





Loe ee ee ee ee ee 


onditions and different exits of 


[Cuap. 2 


wer 
s functions of differ- 


some of the details involved 
that the first instruction of 
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Set w=cos ~ 
t=sin @ 


Form v= > tu 


Set g=cos 0 


Form r=54q 


Let s=cosr 


Let J=sin k 


Form final result 


Set x= 9 
B= B; 





(4) 


Fig, 2-12. Flow chart for computation of radiation intensity. 
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the cos x subroutine is located in address 700. Then, in the four-address 
system, the 6 address of the jump instruction to this subroutine would be 
700, which completes the first step in using a subroutine. The second 
step is to set up the exit from the subroutine. One method is to have in 
address 677 (the address preceding 700) a jump instruction for which the 
8 address is yet to be designated, ie., with the 6 address initially 000. 
Suppose that the main program will continue after the subroutine from 
address 017. Then we would want the 6 address of (677) to be 017; 
then at the end of the subroutine would appear a jump to 677, whence 
upon executing (677), the computer will jump to 017. To set up (677), 
the main program would add a constant with 6 = 017 and zeros else- 
where to (677). Another method of accomplishing this return jump to 
the main program is through some sort of special instruction designed for 
this purpose (see Chap. 3 for further discussion of this point). In this 
case the jump to the first instruction of the subroutine would auto- 
matically memorize its own address in some counter or index register. 
Then a special jump instruction is placed into address 677 that looks at 
this counter or register and jumps back to the main program. 

For the third step in using a subroutine the main routine must some 
how tell the subroutine the values of the initial data, i.e., the independen 
variables. For this purpose a common agreement is made between th 
person who writes the subroutine and the user of the subroutine abow 
certain addresses set aside for this purpose. For example, suppose tha 
address 676 were agreed upon to be the address of the initial value of 
in the cos x subroutine. Then, before jumping to the subroutine, th 
main program would transfer the initial value of x into 676 for the u 
of the subroutine. Similarly an agreement might be made that the fin 
computed value of cos x be placed in 675; when the computer exits fro 
the subroutine back to the main program, the main program first transfe 
(675) to some temporary that the main program uses. Of course, ther 
is an alternative method for performing these two tasks. The mai 
program could first tell the subroutine where the initial value of 2 i 
located and where the final computed value of cos x should be place 
and the subroutine could include the transfer instructions. In such 
case the instructions of the main program that “tell” the subrouti 
would do so by modifying some instructions of the subroutine. 

Library of Subroutines. As illustrated by the example, several su 
routines may be used in one main program. In fact it is found that man 
common subroutines are used quite often, such as sin 2, cos 2, ¢*, Vx 
etc. Hence it often becomes profitable to have a library of subroutine 
available to the programmer, stored at all times in some part of t 
computer memory, There would also be some catalogue kept outsi 
the computer that the programmer can consult when he wants to use 
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subroutine. This catalogue would tell where each subroutine is located 


and all pertinent data about how to use it, such as where to put the 
initial values of the independent variables, where the computed values 
of the dependent variables are found, etc. In addition, as we shall dis- 


: iss in Part 2, automatic-programming aids are devised to make such a 
ibrary of subroutines easier to use. 


2-9. Exercises 


Convert the following decimal numbers to binary, octal, and sexadecimal. Carry 


the co ersl far enou, 
£ t e conv 
a on no h na th erted num ber differs from the or iginal 


Answers 
Deci : 
ecimal Binary Octal Sexadecimal 

we 56 111000 70 38 
“fe - .79 . 110010100 .624 CA 
2-8 8.79 111000. 110010100 70.624 38.CA 

- j . 101110110 -566 
“4 87, 231 10101010010111111 252277 ‘cen 
-6. 87, 231.732 10101010010111111.101110110 252277 .566 154BF.BB 


2-7. Convert —56.79 to binary, octal, and sexadecimal. (Hinr: A negative num- 


her is converted as if it iti ive sign i 
man were positive, and then the negative sign is affixed to the result. 


Convert to decimal: 
2-8. 70 (octal). 

2-9. .624 (octal). 

2-10. BB (sexadecimal). 
2-11. 70.624 (octal). 


aie panes the results of Exercise 2-11 with those of Exercise 2-3, and explain 
hee B: ake an addition and multiplication table for sexadecimal numbers. [Hint: 
1 the ac dition table the rows (and columns) count by 1; the row (or column) of th 
iultiplication table counts by n if n is the number of the row (column).] ss 


lorform the following operations in binary: 


Answers 

9-14. 111 +1 
1000 

2-16. 1010 + 111 10001 
2-16. 1110 — 1 1101 
2-17. 1 — 1110 —1101 
2-18, 1101 X 1011 10001111 
2-19. 11011 + 110 100.1 
Vorform the following operations in octal: 
2-20, 6,754 + 777. 
4-21. 73 — 56. 


2-22. 555 — 62, 
2-23, 5,715 + 65, 
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Using the tables of Exercise 2-13, calculate in the sexadecimal system: 


Answers 
2-24. AFCB + 1AD8 CAA3 
2-25. 842D — 65AE 1E7F 
2-26. 3C X D1 30FC 
2-27. 1A8E + 21 CE 


2-28. Using the instructions defined in this chapter, write a code that calculates x = 
a3 — be, where a = 313 (decimal), b = 983 (decimal), and ¢ = 838 (decimal). 
2-29. In the example given in Sec. 2-4, why are two temporaries needed, that is, 
014 and 015? Write a code using only one temporary. [H1tT: Vit — 44g? = 
(Vo — Segt)t.l 
9-30. If an instruction format for a computer allowed 12 bits for each address, what 
is the maximum number of addressable words possible for the memory of that com- 
puter? [Ans. 21? = 4,096 (decimal) words.] If the first address of this memory is 
0000, what is the last memory address? [Ans. 7777 (octal) (that is, 21? — 1 written 
in octal).] 
9-31. Using the instructions defined in this chapter, write a code that calculates 
X =minimum(U,V,W), where U, V, and W are the contents of addresses 010, 011, 
and 012, respectively. 


Using the instructions defined in this chapter, draw the flow diagram, write a sym- 
bolic code, assign addresses, and finally write the final codes to calculate x [wher 
a = 313 (decimal), b = 983 (decimal), and c = 838 (decimal)|: 

2-32. x =a? — b-c. 

2-33. x = b(a + ¢). 

a} — b-c 

2-34. x = bao) : 

If U, V, and W are the contents of addresses 010, 011, and 012, respectively 
calculate: 

2-35. 2 = minimum(U,V,W). 

2-36. x = maximum(U?,V?,W?). 

2-37. x = maximum(U* — VW, V(U + W)). 

Hinr: The same symbolic code for Exercises 2-32 and 2-33 appears as parts of t 
symbolic code for Exercises 2-34 and 2-37. 


2-38. Code the flow diagram of Fig. 2-4. 

2-39. The symbolic codes of Tables 2-12 and 2-13 are incomplete in that they do n 
reset the computer for additional runs. Complete these codes accordingly and assi 
specific addresses. 


Tf z is found in 001 and n is in 002, draw flow diagrams and write codes that wi 
compute the following functions: 

2-40. 2" /n!. [Hinr: Form a partial product by multiplying by a/(n — 2).] 

9-41. hp =ltata?/2ht+e/Bltec:. (Hinr: Compute enough terms 
that 2"/n! < ¢; this test will tally the breaking of the loop.) 

2-42. The binomial coefficient n!/x!(n — a)!. 

2-43. The first moment of the binomial expansion, namely, 


n 


ua a n! aqnee 
rl(n = a)! Pr" 
wed 
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9-44. Write a routine that forms the sum of the squares of numbers found in 
adresses 030 to 077. 

9-45. Write a routine that puts the numbers found in 030 to 077 in numerical order 
Hine: Successively interchange locations of numbers in adjacent addresses if they 
aro not in order; keep track of the number of interchanges each time through; when no 
\nterchanges are necessary, then the numbers are in order; that is, for example: 


a * * 

1st time through: 4,2,3,1 2,4,3,1 2,3,4,1 2,3,1,4 
; las PA las 

2d time through: 2,3,1,4 2,3,1,4 2,1,3,4 2,1,3,4 
; 7A nN la 

3d time through: 2,1,3,4 1,2,3,4 1,2,3,4 1,2,3,4 
TaN nN a 


4th time through: 1,2,3,4 1,2,3,4 1,2,3,4 1, 2,3,4 
(no interchanges; so numbers are in proper order). 


Construct a library of subroutines as follows: 

4-46. Write a subroutine for +/z for your library. 

8-47. Write a subroutine for 2”/m! for your library. 

%-48. Write a subroutine for sin z for your library. (Hinv: This subroutine will 
\iwolf use the subroutine of Exercise 2-47.) 
' rg Write a subroutine for cos xz for your library. (Do this using only a single 
Hop. 

4-60. Write a manual describing how to use your library of subroutines. 

4-51. Code the example given on page 81, using your library of subroutines. 


CHAPTER 3 


INSTRUCTION FORMATS AND OPERATION TYPES 


3-1. Three- and Two-address Instruction Systems 


Using Fewer than Four Addresses. Up to now we have considered an 
instruction format containing four addresses. It was remarked in Sec. 
9-4 that each of the four addresses is a necessary ingredient to coding. 
However, four-address systems are not used in practice. Commercially 
available computers use only three-, two-, or one-address formats, the 
latter perhaps being the most common.t Computers using fewer than 
four addresses must compensate by (1) having the omitted information 
“wired in” and by (2) using more instructions to achieve the same ends 
The primary advantage of having fewer than four addresses in the 
instruction is that for the same word length each address will comprise 
more bits and hence give the computer a larger addressable memor 
capability. In the 42-bit format described in Sec. 2-4, 36 bits we 
reserved for the four addresses, making each address 9 bits long, for 4 
maximum of 512 words in the addressable memory. If three addresse 
were used, each would comprise 12 bits, allowing a 4,096-word memory 
two addresses, 18 bits per instruction, a 262,144-word memory; on 
address, 36 bits, 68,719,476,736 words. Of course shorter words call 
be designed for machines with one- or two-address instructions, and thi 
in itself can save a great deal of hardware. 

The Three-address Insstrution. The usual three-address instruction 
similar to the four-address instruction except that the 6 address is omitted 
from the format. Hence the format becomes 













Operation | Address of first 


Address of second | Address into which} 
code operand, a | 


operand, 8 result is put, 7 


How does the computer know in what address to find the next instrue 
tion? Except for special instructions, the computer will always take & 
the next instruction the contents of the next consecutive address followi 


+ See the Appendix for examples of typical three-, two-, and one-address systems ¢ 


commercially available computers, 
88 
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(that of the present instruction. That is, the computer will normall 

take the instructions consecutively in the order in which they are acer 
into the memory. This is accomplished by means of the current-address 
register. This register is first set at the address of the first instruction: 
then, as each instruction is executed, the contents of this register is riot 
mally increased automatically by 1, and this contents is taken by the 
‘computer as the address of the next instruction. (Because of this the 
vurrent-address register is often called the current-address counter.) With 


TaBLy 3-1, THREE-aADDRE:s Cope ror Y = Vo,t — Logt? 





Instruction 





Addre: 
ee Opera- Remarks 





0000 53 0007 0007 0007 | Take first i i 
rst instruct 
0001 42 0010 0012 0014 =| For ee nana 
: m Voyt 
0002 42 0012 0012 0015 | Form ?#? 
0003 42 0015 0011 0015 |} Form gi? 
0004 41 0015 0013 0015 | Form 14g? 
0005 | 54 0014 0015 0015 | Form Y 
es | 60 0000 0000 0000 | Stop computer 
0010 00 0000 0000 1750 | Voy 
OOL 1 | 00 0000 0000 0040 | g 
0012 00 0000 0000 0065 |¢ 


a | 00 0000 .0000 0002 | 2 constant 


0015 Voyt temporary 


Temporary, where final result is 
found 


a et 





























(his understanding of where the computer will find the next instruction 
the only change necessary in the add, multiply, divide, and subtract instrue- 
lions given in See. 2-4 (see Table 2-5, page 64) is the deletion of the 6 
address, However, the comparison instruction will need redefining thus: 
I (a) > (8), take next instruction from y; otherwise take the next cote 
wocutive instruction. If (aw) > (8), then the current-address counter will 
wilomatically be set at address y; if (a) < (8), the counter will behave 
in usual, just increasing by 1. 

Nvamples. For instance, the examples coded in Secs. 2-4 and 2-5 
would have to be rewritten as shown in Tables 3-1 and 3-2. 

"here is not much difference between Table 3-1 and its four-address 
version, Consider, however, Table 3-2. Each branch of this code 


should be carefully reconstructed by the student so that he may grasp 
jiwt what problems arise from ordering the instructions and how these 








Te 
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problems are solved. Note that there appear in addresses 0004 and 0001 
dummy comparisons, which artificially force a jump to address 0012, 
where the stop instruction is located. Also, a dummy comparison was 
made in address 0006 in order to reuse (0003). A total of 11 instruc- 


tions were necessary in the three-address system, while 8 sufficed in the 


+ 
TABLE 3-2. THREE-ADDRESS Cove To DETERMINE THE LARGEST OF THREE NUMBERS 


j 























Instruction 
Address Opera- Remarks 
tion i B 7 | 
0000 54 0021 0021 0021 | Take first instruction from 0001 
0001 43 0015 0016 0005 | U:V 
If U > V, take 0005 





If U < V, take 0002 

0002 43 0016 0017 0007 | V:W 

If V > W, take 0007 

Tt V < W, take 0003 

0003 53 0017 0021 0020 | Transfer W to largest number cell 
0004 43 0022 0021 0012 | Jump to stop 


0005 43 0015 0017 0011 |U:W 
If U > W, take 0011 


If U < W, take 0006 


0006 43 0022 0021 0003 | Jump to transfer W instruction 
0007 53 0016 0021 0020 | Transfer V to largest number cell 


0010 43 0022 0021 0012 | Jump to stop 
oo11 53 0015 0021 0020 | Transfer U to largest number cell 


0012 60 0000 0000 0000 | Stop 














0013 

0014 

0015 U 

0016 | V }given numbers 
0017 Ww 

0020 Largest number cell 
0021 00 0000 0000 0000 | 0 constant 

0022 00 0000 0000 0001 | 1 constant 












































four-address system. The placement of the instructions was somewh 
complicated; this difficulty is relieved by symbolic coding. 

The Two-address Instruction. In the two-address instruction usua 
both the y and 6 addresses are eliminated. The format is simply 


Operation | Operand Operand 


code address a | address 8 





The computer finds the location of the next instruction as in the th 
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address system, taking care of the function of 6. Where does the result 
of the operation go? This problem is solved in a two-address system by 
having the result of every arithmetic operation automatically left in a 
very special memory location, called the accwmulator. This special 


TABLE 3-3. Two-appreEss INSTRUCTIONS 





Code Operation Meaning 


53 | Add : Add (a) to (8), and put result in accumulator 

42 | Multiply Multiply («) by (8), and put result in accumulator 

54 Subtract Subtract (8) from (a), and put result in accumulator 

41 | Divide Divide (a) by (8), and put result in accumulator 

43 | Compare If (a) > (acc), take 8; otherwise go to next consecutive 
instruction (as usual) 

52 | Transfer Transfer (acc) into a, and take B 

51 | Add and transfer Add (a) to (acc), and put result into B 

40 | Multiply and transfer | Multiply (a) by (acc), and put result into B 





TaB_e 3-4. Two-appREss VERSION OF CopE oF TABLE 3-1 








Instruction 














Address Goats: Remarks 
tion 
000 54 017 017. ‘| Take first instruction from 001 
001 42 010 012 =| Form Vo,é in accumulator 
002 52 014 003 (|| Put Voyt into 014 
003 42 012 012 =| Form ¢? in accumulator 
004 42 011 777 ‘|| Form gi? in accumulator 
005 41 777 013 «| Form }4gt? in accumulator 
006 54 014 777. ~*| Form Y 
007 52 015 016 Put Y into 015 
010 00 001 750 Voy 
011 00 000 040 |g 
012 00 000 065 t 
013 00 000 002 =| 2 
014 Vo,t temporary 
015 Temporary, where result is found 
016 60 000 000 | Stop computer 
017 00 000 000 | 0 constant 








memory cell is located in the arithmetic unit; it has been discussed in 
Chap. 1 of this book. It suffices to note here that y is most often under- 
slood to be the accumulator, although as we shall see below there can be 
Muny variations to this theme. Hence the definitions of our instructions 
must be rewritten as shown in Table 3-3, 
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Of course, after the sum is in the accumulator, it must be transferred 
to an ordinary memory location. Hence we need transfer instructions. 
Examples. As mentioned above, the word length is often shortened 
for two- or one-address systems; so let us, for illustration purposes, use a 
word length of 24 bits plus one sign bit. In order to increase flexibility, 


TaBLE 3-5. Two-Aprress VERSION OF CopE oF TABLE 3-2 


a 











Instruction 
Address Remarks 
Opera- 8 
tion = 
000 54 021 021 | Take first instruction from 001 
001 53 016 021 | Put V into accumulator 
002 43 015 005 U:V 


It U > V, take 005 
If U < V, take 003 
003 43 017 010 | W:V (since V is already in accumulator) 
If W > V, take 010 
If W < V, take 004 


004 52 020 012 | Transfer V from accumulator into 020 
005 53 015 021 | Put U into accumulator 
006 43 017 010 W:U 


If W > U, take 010 
If W < U, take 007 












































007 52 020 012 | Transfer U from accumulator into 020 
010 53 017 021 | Transfer W into accumulator 

011 52 020 012 | Transfer W from accumulator into 020 
012 60 000 000 | Stop computer 

013 

014 

015 U 

“016 | given numbers 

017 Ww 

020 Largest number cell 

021 00 000 000 | 0 constant 


nn sn 


the accumulator is often given an address, say, for example, 777. The 
example codes as revised are shown as Tables 3-4 and 3-5. Note that 
two more instructions were needed in Table 3-4 than for the three- or 
four-address codings. The contents of the accumulator must be carefully 
watched, or errors might creep in. 

In Table 3-5 note how the comparisons were arranged so that the next 
instruction in sequence could be a transfer instruction direct, from the 
accumulator. This was successfully accomplished for V and U. How- 
ever, W had to be placed in the accumulator especially for its transfer 
to the largest number cell. In any event it is to be noted that the place= 
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ment of instructions and the order of the comparisons become more com- 
plicated in a two-address system. Note also that 11 instructions were 
needed, one more than was absolutely necessary for the three-address 
system (see Exercise 3-1) and three more than for the four-address system. 


3-2. One-address Instruction Systems 


The One-address Instruction. In the one-address instruction only one 
operand address, a, remains. The accumulator does a dual job, standing 
in for both the second operand address 8 and the address y into which 
the result is put. Address 6 is handled as in the three- and two-address 


TABLE 3-6. ONE-ADDRESS INSTRUCTIONS 


Code Operation Meaning 








53 | Add Add (a) to (acc); put result in accumulator 

42 | Multiply Multiply (a) by (ace); put result in accumulator 
54 | Subtract Subtract (a) from (ace); put result in accumulator 
41 | Divide Divide (ace) by (e); put result in accumulator 


43 | Conditional jump | If (acc) is negative, take a; otherwise take next instruction 
in sequence as usual 

52 | Transfer Transfer (acc) into a 

51 | Replace add Add (a) to (acc); put result into a 

40 | Replace multiply | Multiply (a) by (acc); put result into a 

44 | Jump Take next instruction from a 








aystems. There are a large number of variations on this theme, and 
often additional registers or special memory locations are used. How- 
ever, in presenting the principles of one-address coding it suffices here to 
consider only one register, the accumulator. In a one-address system 
iuny more kinds of instructions are found essential. For example, in a 
(ypical modern one-address computer there are over 200 instructions. 
This should be compared with the 50 and 40 instructions defined for 
typical modern two- and three-address computers. It is evident that, 
(he more complex and numerous the instructions that have to be learned 
hy the coder, the harder it is for him to code. In order to help the coder 
who deals with a machine with, say, 90 instructions, advanced program- 
ming techniques are resorted to; these will be discussed below. The 
necessity for advanced programming techniques to aid the coder prepare 
programs for a one-address computer system becomes even more striking 
when one reads the rather tortuous techniques needed to accomplish the 
simplest of complete operations, such as comparisons, as illustrated by 
ihe examples to be presented below. The list of one-address instructions 
in Table 3-6 is sufficient to meet our needs for the examples. 

The transition from two to one address for the add, multiply, subtract, 


a 
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and divide instructions is natural enough, but how is comparison per- 
formed? To perform a comparison, one of the numbers is put into the 
accumulator and the other subtracted from it; then a conditional jump 
is used. (A computer with more than one register may compare the 


contents of two registers.) 


TABLE 3-7. ONE-ADDRESS VERSION OF Cope or TaBLE 3-l 
me 








Instruction 
Remarks 
Address Opara: 
tion 

000 44 001 | Take first instruction from 001 
001 42 016 
002 53 021 Clear ‘accumulator, and form ¢? 
003 42 021 

004 42 020 | Form gt? 

005 41 022 | Form }4gi? 

006 52 023 | Put 14g? into 023 

007 42 016 

010 53 017 Clear accumulator, and form V oyt 
O11 42 021 

012 54 023 | Form Y = Voyt — 14g@? 

013 52 023 | Put Y into 023 

014 60 000 | Stop computer 

015 

016 00 000 | 0 constant 

017 OL 750 Voy 

020 00 040 |g 

021 00 065 |¢ 

022 00 002 | 2 

023 44gt? temporary and result 








Se es 


Let us shorten our word to 15 bits plus a sign bit, for 


Examples. 
‘ our examples can be 


convenience in this presentation; then the codes for 
itten as shown in Tables 3-7 and 3-8. 
oe for the first, that it is necessary to make sure that the contents 
of the accumulator was zero before using the add instruction to transfer 
a number into the accumulator. Also, we calculate Lo gt? ae 80 that 
after forming Vo,t in the accumulator we can subtract logt? directly 
from the accumulator, thus avoiding an extra clearing, etc., of the accu- 
mulator. More instructions are needed than for the other systems: one= 
address, 12; two-address, 7; three-address, 7; four-address, a One less 
instruction than used will suffice: the student should cheek this for him- 
self, revising the code to require one less instruction. 


i ions i ’ sompared 
There are 25 instructions in the code of the sec ond example as compe 
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with 11 for the corresponding code in the two-address system, 10 in the 
three-address system, and 8 in the four-address system. By choosing 
it more convenient set of one-address instructions (cf. Sec. 3-4) we might 
have reduced the length of the code somewhat, but not significantly. 


TABLE 3-8. ONE-ADDRESS VERSION OF CODE OF TABLE 3-2 

















Instruction 
Address Remarks 
Opera- 
tion . 

000 44 001 | Jump to first instruction of code, 001 

001 42 035 | Clear accumulator 

002 53 032 | Put V into accumulator set up comparison 

003 54 031 =| Form V — U in accumulator 

004 43 015 | If (acc) negative, U > V, try U:W; if positive, U < V, 
try V:W 

005 42 035 | Clear accumulator 

006 53 032 =| Put V into accumulator Ja up comparison 

007 54 033 | Form V — W in accumulator 

010 43, 025 | If (acc) negative, W > V, W largest; if positive, W < V, 
V largest 

Olt 42 035 | Clear accumulator 

012 53 032 | Put V into accumulator V is largest number 

013 52 034 | Put V into largest number cell 

014 44 030 | Jump to stop 

015 42 035 | Clear accumulator 

016 53 031 | Put U into accumulator fa up comparison 

O17 54 033 | Form U — W in accumulator 

020 43 025 | If (acc) negative, W > U, W largest; if positive, W < U, 
U largest 

021 42 035 | Clear accumulator 

022 53 031 | Put U into accumulator |v is largest number 

023 52 034 | Put U into largest number cell 

024 44 030 | Jump to stop 

025 42 035 | Clear accumulator 

026 53 033 | Put W into accumulator \w is largest number 

027 52 034 | Put W into largest number cell 

030 60 000 | Stop computer 

031 U 

032 V }given numbers 

033 Ww 

034 Largest number cell 

035 00 000 | 0 constant 





One-plus-one-address Systems. There is another kind of two-address 
instruction that deserves mention. It is a two-address instruction where 
(he a and 6 addresses are retained, instead of the a and 6 addresses. In 
other words, an addition instruction might read; “Add (a) to the accu- 





a 
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mulator, and take the next instruction from 6.” Such a system has the 
coding characteristics of a one-address system ; the purpose of the second 
address is to allow minimum access coding when a drum is being used. 
That is, the second address presents some further flexibility in the placing 
of the instructions in the drum memory. This is often called a one-plus- 
one-address system. 


3-3. Decimal Systems 


Decimal-coded Binary. The only variations in the coding systems so 
far described were with respect to the instruction format and, in par- 
ticular, to the number of addresses that were explicitly displayed in these 


Tape 3-9. UNIVAC X-3 CopE 


Binary Binary Binary Binary 
Symbol code Symbol code Symbol code Symbol code 

0 00 0011 

1 00 0100 A 01 0100 Si 10 0100 

2 00 0101 B O1 0101 K 10 0101 Ss 11 0101 
3 00 0110 Cc O01 0110 L 10 0110 T 11 0110 
4 00 O111 D O11 O111 M 10 O111 U 11°«+OO11L 
5 00 1000 E 01. 1000 N10. 1000 V 11 1000 
6 00 1001 F Ol 1001 O 10 1001 W 11 1001 
7 00 1010 G Ol 1010 P- 10 1010 xX 11 = 1010 
8 00 1011 H 01 1011 Q 10 1011 Y 11 1011 
9 00 1100 I 01 1100 R 10 1100 Z 11 1100 


formats. In all these systems numbers were written in octal-coded binary; 
ie., a set of octal characters represented a binary number. In fact, the 
three bits represented by each octal character corresponded to the same 
number with radix 2: thus, 3 (octal) represented 011, 5 (octal) represented 
101, ete. The instructions were written with the same shorthand, the 
octal characters representing the real equivalent 3-bit binary numbers. 

There is another system for coding numbers and an associated short- 
hand for representing the bits in an instruction, called decimal-coded 
binary. Either octal-coded binary or decimal-coded binary can be used 
with four-, three-, two-, and one-address instruction formats. 

Usually, in decimal-coded binary, 6 bits are associated with a single 
decimal number. Since there are 64 different combinations that can be 
formed with 6 bits, the letters of the alphabet are also assigned binary 
codes. These assigned codes depend on the logical design of the come= 
puter. For example, one such system (UNIVAC) is as shown in Table 
3-9. 

In such decimal computers a decimal number is written into a word 
“as is,” with no conversion required, Of course the resulting binary 
number is not a number at all, but a binary code, The arithmetic unit 
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then handles 6 bits at a time, and is wired so as to produce the correct 
results. For example, in octal-coded binary we would have 


Corresponding binary operation 


Octal as it would appear in computer words 
5 000 101 
+ 6 +000 110 
13 001 O11 


In decimal-coded binary we would have 





Decimal 
5 000 011 001 000 
+ 6 +000 011 001 001 
11 000 100 000 100 


Note that a word that contains letters cannot be added to another 
word since addition is defined only for numbers. Also, an address must 
be denoted by pure numbers, not letters (for, as we shall see below, we 
must be able to add to or subtract from the addresses). The lattars casi 
be used only for the operation code or for some letter constant. 

lhe advantages of a decimal system are simply that no conversion 
need be made for numbers and that letters can be read into and out of 
the computer directly. However, the disadvantage of a decimal system 
in that it is difficult to write codes where the value of a particular single 
bit of a word has significance, as is necessary, for instance, in switch 
sensing ; while there are methods for determining the value of some bit 
ol the word, in general the bits are used in the computer in groups of six 

Kvample. As an illustration of a code in such a system, consider the 
(hree-address instructions of Table 3-10, where one decimal symbol is 
used for the operation code and three decimal digits are used for each 
address. Here (a) + (8)— y means “The contents of a@ are added to 
the contents of 8, and the sum is put into address y,” etc. 


TABLE 3-10. THREE-ADDRESS DecrmaL INSTRUCTIONS 








Code Operation Meaning 
A | Add (2) + @)> v 
M | Multiply (a): (B)> y 
S| Subtract (a) — (8) 
D_ | Divide (a) + (B)> v 








C | Compare 


If (a) > (8), take y 
P| Stop computer 


eee 
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The code for the first of our examples is shown in Table 3-11. 


LE 3-1 
TasieE 3-11. DEcrIMAL-CODED Binary VERSION OF Cops or TAB 







Instruction 


















































Remarks 

000 S Take first instruction from 001 
001 M Form Voyt 
002 M 010 010 013. | Form ? 
003 M 013 009 013 | Form gt? 
004 D 013 oll 013 | Form }égi? 
005 S 012 013 013. | Form Y 
006 P 000 000 000 | Stop computer 
007 
008 0 000 001 000 | Voy 
009 0 000 000 032 ig 
010 0 000 000 053 «it 
O11 0 000 000 002 | 2 

2 V oyt temporary : 
a | Temporary where final result is found 


3-4. Arithmetic, Logic, Decision, and In-Out Instruction Types 


In this and the following section we present a of a importa 
i i i d in modern digital com , 
kinds of instructions that are foun t a ae 
ject 1 cri ts upon which the various Kin 
object is to describe the concep ae 
i fie details of the instructions 
tions are based, rather than speci nee 
those which are most commonly d. 
The concepts are chosen from — my 
i ; ts may take many dilleren 
Instructions based on these concep diff i i 
the many different computers. For purposes of povine ee a 
i ins i ts have been classified into live , 
tation, these instruction concepts © ee: 
ies: i ¢ instructions; (2) logical an 
categories: (1) arithmetic instruc Ce ccracta 
i ions; ision 1 tions; (4) read and wrae 1 ' 
instructions; (3) decision instruc : ( ae 
i iding 1 tions. Although there is some overlap , 
(5) recursion-aiding instruc A 
it 1 i i h instruction as falling int ‘ 
it is most convenient to consider eac a 
i 1 through 4 are considere 
ly one, of these categories. Categories ee 
ie section; category 5 is considered after the discussion of addressing 
in the next section. , 
. Floating Point. One of the important concepts that appears In a 
metic instructions is the automatic floating decimal point. : As ro eae 
some of the above examples and problems, in ordinary arithmetic a 
reer the coder was limited to handling numbers within the range of t 














Sic. 3-4] INSTRUCTION FORMATS AND OPERATION TYPES 99 


word length. For example, if the word length is 15 bits plus a sign bit, 
and if all numbers are interpreted as being nonfractional, that is, if the 
binary point is considered to be on the right, then only integers that lie 
between 0 and 2'5, or —1 and —2", can be handled by the computer. 
Obviously there must be some way out of such a totally unacceptable 
situation. One method is to multiply all numbers involved in a code 
hy an appropriate scaling factor so that they fall within the desired range 
(luring the computer computations. Of course the results put out by the 
computer must be readjusted to their actual values. However, it suf- 
lices to mention here that scaling presents more work for the coder and is 
i disadvantage. Many computers avoid this disadvantage by having a 
no-called floating point for arithmetic operations. 

In computers with floating-point arithmetic operations, a number is 
inemorized in a word composed of two parts: a mantissa, which contains 
the significant figures of the number, and an exponent. For example, one 
computer that has a 45-bit word uses 36 bits for the significant figures 
and their sign and 9 bits for the exponent and its sign. Hence this 
vomputer can handle numbers whose absolute values are as large as 
(2"° — 1) & 255 (since 2° = 256) or as small as 2—*55—that is, numbers 
Within this range with no more than 385 significant bits. A range such 
lis this covers most cases that usually occur and hence relieves the coder 
of all sealing considerations. 

The terms normalized and unnormalized are often used in connection 
with floating-point arithmetic operations. These terms are concerned 
with the problem of determining the appropriate exponent of the result 
of an operation. The result of an operation is called normalized if the 
exponent is so adjusted that the leftmost bit of the mantissa is a unit 
hit. This of course maximizes the number of significant figures that 
wun be recorded in a word. A computer operation that presents nor- 
iulized results is called normalized; e.g., we would say add-normalized. 
Aw an example, suppose that a computer had a 24-bit (8 octal position) 
mantissa and we multiply-normalized the (octal) numbers (with octal 
exponent): .00033333 X 2° by .04444444 & 2%. The unnormalized result 
(double length) is .0000 1753 0442 4714 « 2". Of course this is a 
hinary number in the computer, and the problem is to determine that 
exponent which will move the binary number to the left until a unit bit 
ippears in the leftmost position. The number of bit positions the man- 
(inna will have to be moved to the left in our example is 3 X 4 + 2 
{where 3 X 4 is the number of bit positions corresponding to four zeros; 
and since 1 (octal) is OO1 (benary), a move two further positions to the 
loft is necessary]. Figure 3-1 illustrates the binary and octal forms of the 
double-length mantissa before and after the move. 





a 
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The final normalized result will therefore be .7654 2212 x 2% [where 
11 — (3X 4+2) = —5]. Of course the computer will do oe a 
matically and will display only the “single-length” result. (See aS 7 
further discussion of single- and double-length results of multip ie a 
Note that, if the computer binary point is understood to be to the right, 
the above multiplication would be 


00033333 X 2-22 x 04444444 X 27° = 00001753 04424714 x 2-7 


lions: e.g., in a three-address system, if no overflow, then take next 
instruction; if overflow, take y. 

9, Multeply-add. In a two-address system, form (a) - (acc) + (8), and 
put result into accumulator. 

10. Partial add-a address. .In a two or more address system, only the 
w portion of the words is added. Similarly there are add-8, -y address 
instructions. 

8. SUBTRACT. The kinds of subtract instructions are analogous to the 
kinds of add instructions but would also encompass: 

\1. Change sign of register. This is not a subtract instruction but only 
vhanges the sign of the accumulator. 

C., MULTIPLY. Before describing the kinds of multiply instructions we 
Must first describe briefly how multiplication takes place in an arithmetic 
nit. First note that, if an n-digit is multiplied by an m-digit number, 
ihe result can have as many as n + m digits. In a computer, therefore, 
the result of a multiplication can contain twice as many bits as appear 
in « word. Hence there are usually two registers in the arithmetic unit 
(or one double-word-length register) to record the result of the multipli- 
jution. If a word has m bits, then the least significant m bits (which are 
found in one of the registers, called the minor register) are called the minor 
multiplication result, while the rest of the bits are called the major multi- 
plication result (and are found in the other register, called the major 
peqister). 

lor example, if we consider a 24-bit (8 octal position) word, and multi- 
ply 00033333 by 04444444, we obtain as a result 00001753 04424714. 
Here 00001753 is the major product, and 04424714 is the minor product. 

|, Regular multiplication. Leaving both major and minor results in 
(he registers. 

2, Major multiplication—unrounded. The memorized result is the 
inujor multiplication result. 

4, Major multiplication—rounded. The memorized result is the major 
inultiplication result with a least significant unit added to the major 
tosult if the most significant bit of the minor multiplication result is a 
wnit, If the multiplication is normalized also, the normalization is done 
lirat, then the rounding. 

|, Floating multiplication—unnormalized. The exponents are added, 
und the mantissas are multiplied. 

5. Floating multiplication—normalized. After the multiplication the 
product is shifted and the sum of the exponents adjusted so that the most 
siwnificant bit is a unit. 

, Rounding only. This is not a multiplication instruction, but the 
umber in the major register is rounded with respect to the number in 
the minor register, 


If the product were normalized by shifting left, dropping the last —- 
places would be equivalent to multiplying by 2~‘° and the exponent wou 


1 4 Before 
0 0 0 0 1 7 5 3 0 4 4 2 4 7 “a 
000 000 000 000 001 111 101 011 000 100 100 010,100 111 001 100 move 


Q00 000 After 
111 110° 101 160°010 010° 001 010 011 100° 110 000 000 000 ‘a 
7 6 5 4 2 2 1 2 3 4 6 0 0 9 0 QO move 


Fic. 3-1, Normalized and unnormalized mantissa. 


have to be corrected accordingly: —47 — [(3 X 4) + 2] + 30 = —35, 
ivi y 89 as sult. 
iving 7654 2212 X 10~*° as the resu 

: When an unnormalized operation is defined, the method for determin 

ing the exponent must be clearly stated. ae 
aaene Instruction Types. We can now turn +0 a description 0) 

the kinds of arithmetic instructions. Consider ‘‘add first. 

A. App. 1. Ordinary addition instruction, as described in previou 

sections. | 

2. Floating addition—unnormalized. The exponent of the result wi 
ds. 

be that of the largest exponent of the two operan 

3. Floating addition—normalized. The exponent of the result will 
adjusted so that the most significant bit of the number is a unit. 

4. Absolute-value “add.” In a two-, three-, or four-address syste 
the result would be = |(a)| + |(8)\- 

5. Add into accumulator. In a one-address system (acc) + (a) is pu 
into the accumulator. 

6. Add into memory. Ina one-address system (acc) + (a) is put into 

7. Add relative. The instruction code itself tells that the address 
to be considered “relative” to some “relative counter (see Sec. 3-5). 

8. Add with overflow check. If the sum is greater than the largest nu 
ber that can be stored in a word, then overflow is said to have oceurre 

If overflow occurs, then the computer is to break the sequence of instru 
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p. pivipg. In dividing, the quotient appears in the major register, the 
remainder in the minor register. This brings up some variations in the 
divide instruction. 

. Quotient divide. The quotient is the memorized result. 

_ Remainder divide. The remainder is the memorized result. 
. Quotient divide—rounded. 

. Floating divide—unnormalized. 

. Floating divide—normalized. 

_ Divide with overflow check on quotient. 

Logical and Bit-handling Instruction Types. These instructions are 
used for two main purposes. First they are used to modify instruc 
tions; second they are used when it is desired to find whether the bit in 
a Pi position is zero or a unit, as is often necessary when bits represent 
switch inputs in a machine. 

A. LOGICAL ARITHMETIC. 1. Logical addition. There is no carry, ang 
the bits in the corresponding positions of the two operands are ‘‘added 
according to the rule 1 |+ 1 = 1,1 |+0=1,0|+41= 1,0|+0=0 


For example, 


aoorwnn 


0101 A 
Logical + 0011 B 
Oll1 A ee B 


which can, of course, be generalized to a word of any number of bits i 
length. ‘ 

2. Logical multiplication. Similar to logical addition except tha 
1[-1=1,1[0=0,0|, 1=0,0|-0=0. For example, 


0101. A 
Logical X 0011 B 
0001 «= A |: B 


which can be generalized as above. 
3. Logical ring add (or logical inequality). 
except that 1/#1=0, 1|#0=1, O0|#1= 


example, 


Similar to logical addition 
1,0|40=0. Fo 


0101 
0011 


0110 


A 

Logical ring + B 

A|¥ B 
which can be generalized as above. 

4. Logical compare (or logical equality). 

except that 1|= 1=1, O|= O= 1, | |= 0 = 0, 


Similar to logical additio 
Olmi=0. F 










Suc. 3-4] INSTRUCTION FORMATS AND OPERATION TYPES 103 
oxumple, 
Lagieal 0101 A 
ogical compare 0011 B 
1001 A =B 


which can be generalized as above. 

5. Complement. Units are changed to zeros, and zeros to units; e.g., 
i! (w) = 0101, the complement of (a) is 1010. 

(. Hextract. For a four- or three-address instruction, where a bit of 
(w) is a unit, replace the corresponding bit of (vy) with the corresponding 
bits of (8). For example, 





(a) 000 411 I 000 O11 

(8) 010 110 1 

(y) 011 ( 101{ 110 010 101 
Result, found in y 011 “001 “101 010 110 


7. Odd-even extractor (used for decimal computers). Replace those 
ligits of (vy) which correspond (in position) to odd digits of (a) with the 
eorresponding digits of (8). For example, 





(@) 222 555 793 246 277 
(6 134 765.2 
(vy) 867 { 6 243 876 
Result 867 “682 \304 243 810 
i, SHIFT INSTRUCTIONS. 1. Right shift. The whole (a) is shifted to 
the right by the number of positions specified in the instruction. If the 
shift is m positions, the effect is the same as multiplying by 2-". Those 


hits which run off the right end of the word are lost; those which come 
jito the left end are zeros. 

2, Left shift. Like the right shift, but in the other direction, i.e., like 
Multiplying by 2”. 

4, Circular shifts. Shifts where the bits that go off one end come back 
al the other as if the ends of the words were connected together. (These 
ave also called “end-around shifts.’’) 

|, Left significant shift. Shift till there is a unit in the most significant 
pluce. This differs from the above shift instructions in that the number 
ol positions to be shifted is not given by the instruction but is determined 
(luring the operation. The number of positions shifted is often recorded 
i 8; the shift is performed in the accumulator for a one-address system. 

Decision Instruction Types. When a flow chart of a program comes to 
4 branch point, some form of decision instruction must be used to deter- 
iiine which branch should be taken. As far as the segmented execution 
of (he instructions in a program is concerned, a branch point means that 





a 
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the program either will continue on normally or will jump to another 
part of the program. The term jump is used to indicate that at such a 
branch point the next instruction is taken from some part of the memory 


other than the usual sequence. 
A. COMPARISON OF worDs. 1. Usual comparisons. If (a) > (8), then 


take y for four- or three-address computer; if (a) > (acc), take 6 for 


two-address computers. 
2. Absolute comparison. Same as 1, except that the absolute values o 


the words are compared. 


3. Negative jump. If (acc) is negative, take a. 

4. Positive jump. If (ace) is positive, take a. 

5. Zero jump. If (acc) = 0, take a. 

6. Nonzero jump. If (acc) # 0, take a. 

7. High jump. If (ace) > (a), skip next instruction. 

8. Low jump. If (acc) < (a), skip next instruction. 

9. Equal jump. If (ace) = (a), skip next instruction. 

10. Least significant jump. If least significant bit of (acc) is 1, then 
jump to a. 


Of course items 3 to 10 are for one-address systems; in items 7 to | 
the number of instructions skipped may be more than one but would 
be fixed for a given machine. 

B. SENSING OF SIGNAL SwitcHES. 1. Special signal transfer. 
tain signal specified by the operation is on, then take a. This instruc 
tion is used in conjunction with an instruction that turns on the signal 
This latter instruction may be separated in the program from the special 
signal-transfer instruction so that the jump occurs some aaa th 
signal was turned on. There may be several signals and associated set 
of instructions. 

2. Manual jump. Here a jump will occur only when | a manu 
external switch has been set before the program reaches the instruction, 

3 Manual halt and transfer. The computer will stop if the extern 
switch specified by the instruction has been turned on. Then, when th 
start button is pushed, the jump to @ will occur. If the switch is off 
the computer jumps without stopping. ; 

4. Return jump. In a two-address system the program jumps to (8 
for the next instruction; at the same time the contents of the curren 
instruction register (i.e., the address of this particular return-jum 
instruction) is recorded as (a). The purpose of this instruction is tha 
it is often desirable to jump back to the place from which the origin 

ogram jumped. oe: 
aes ae ae As was pointed out above, the “sign” bit is 
auxiliary bit. The sign bit can be given meaning for instructions 
follows: if it is 0, the computer continues to compute as usual; if it is 1 
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then the computer halts. Important use is made of such a system in 
detecting errors in a program. Suppose that a coder were running a 
program on a computer for the first time; he might want to stop the 
computer at intermediate stages in the program to see whether or not 
the code were correct up to those points. In order to do this, he would 
put 1 in the sign bits of those instructions at which he wants to stop the 
computer so that he could examine the partial results. An alternative 
to this procedure, which serves the same purpose, is to have the computer 
(uke the next instruction from a standard, predetermined location in the 
memory whenever a 1 appears in the sign bit of an instruction. At the 
standard position a special code might be located that can do various 
things to aid the coder detect errors and correct his code (see Sec. 5-2). 

Read and Write Instruction Types. These instructions operate the 
input and output electromechanical units. 

A. MAGNETIC TAPES. These instructions can enable the computer to 
move the magnetic tapes forward or backward, or completely to rewind 
them. They enable the computer to read from the magnetic tapes into 
the higher-speed memory. The number of words written onto or read 
from the magnetic tape is specified either in terms of individual words 
or, more usually, in terms of the number of blocks of words, where by a 
block of words we mean some predetermined number of words. If, for 
oxumple, there are eight words in a block, then only multiples of eight 
words can be handled. Similarly, in just moving the magnetic tapes, the 
(listance to be moved is specified in either single words or blocks of words. 
The instruction code tells whether to read or write, or move the tapes 
backward or forward; the number a tells how many words or blocks are 
to be considered. For reading and writing, the number £ is the address 
(hat the first word is read into or written out of. 

li, PUNCHED TAPES AND CARDS. Instructions that tell the machine to 
road in from paper tape or to punch paper tape either give the initial 
and final addresses to be read into or out of or else give the first. address 
and the number of words to be considered. Sometimes in reading-in 
paper tape, only the initial address is given, and a signal is put on the 
paper tape to tell the computer when to stop reading. Punched-card 
input-output is similar. 

«, bispLAY. For displays on an oscilloscope each instruction may tell 
the display to exhibit one dot or other geometrical pattern, and the 
ljnatruction gives the coordinates of that dot or of some point on: the 
pattern, 

), CONCURRENT IN-OUT. Some computers have concurrent input and 
oulput, In these cases the computer can compute at the same time that 
words are being read into or out of the computer. 

In addition it should be mentioned that some computers have such 





——— 
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three operational words are equal to one memory word, the instructio 
a nd its two arguments could be all read into the contputing unit at ae) 
lor more flexibility, the number of operational words to be accessed at 
the same time can be put under the control of the programmer. In 
effect, we are then fixing the length of the operational word and all a 
(he memory-word length to vary. es 
A closely allied capability is to place the length of the word to b 
accessed under the control of the programmer. In this way, variabl : 
length business records, files, and other items can be sae ciently 


complicated methods of reading in and out that special codes must be 
written to facilitate these processes, and these codes must be available at 
all times inside the computer’s memory. Often these codes must be put 
into the computer at first by hand methods, in a sort of bootstrap oper- 
ation. Once the routines are in the computer, the operation may never 
need to be repeated. However, these methods are so peculiar to the par- 
ticular computer involved that it is not worthwhile considering them here. 
In the Appendix we have summarized some of the instructions on three 
typical, commercially available computers which employ many of the 
concepts discussed in this section. 
3-5. Indirect Addressing and Relative Indexing 


a2 Address 





Instructions can be defined to perform many noncomputational func- on 
. . —Wea° Ss 
tions for the purposes of extending the computer’s capabilities and. a3; 
a4 
Operational — 
Instruction | Maid 
register length Memory 
1 words 
; Operational Memory , 
First argument Comput ia, 3-8. Ilustrati i ; 
word word puter . Illustrating partial addressing. If n = 4 and the f aig 
realstet length length memory then the partial address for a1 = 1, a2 = | would access fie ra cane r anes 
1100, 1101, 1110, 1111, shown bracketed. alia ats 
Operational 
Second argument . 17 : 
register beat raaerets in the memory. Here the length of some component of the 
operat . 
oo a )perational word, say a character, is fixed, and the number of such char- 








uclers is allowed to vary, resulting in variable-length words. 

\, PARTIAL ADDRESSING. Varying the number of words to be accessed 
int he memory can be accomplished by using a shortened address. Thus 
wonsider a 2"-word memory, where each word is identified by an n-bit 
address: an (n — 1)-bit address will identify one of 2”! = 2”/2 pairs of 
words; or in general a partial (n — p)-bit address will identify one of 
ur"? = 2/2? groups of 2? words. For illustration, if n = 4 (see Fig. 3-3) 
then the partial address 11, say, would access the four adjacent addresses 


Frc. 3-2. Illustrating “memory-word’”’ length being a multiple of the “operation: 
word” length. Hence in this case a single memory-word access produces three oper’ 


tional words for the computation unit. 


enhancing the programmer’s convenience. For the former purpose 
variable word addressing and indirect addressing are utilized; for th 
latter purposes, relative indexing and special recursion-aiding instru 
tions are utilized. In this section we consider the principles upon whie 
such instructions are based. 

Variable Word Addressing. In currently available computers, t 
memory access time is generally longer than the instruction executio 
time. To take fuller advantage of the high computational speed offere' 
by modern computers, it therefore often becomes desirable to selee 
several words from the memory simultaneously, or at least during t 
same phase. The words thus chosen are read out of the computer memor 
into high-speed registers available in the computing unit. One techniqu 
for accomplishing this is to have one ‘memory word’? really equivalen 
to several “operational words’’ (see ig, 3-2). Thus, for example, 


1100 1101 1110 1111 


Partial addressing can be incorporated into the instruction format of a 
decimal computer by filling in the blank part of the partial address with 
i letter character. For a binary computer, a convention might be made 
(hat every address is to have a unit as its most significant bit; then the 
(isplacement of this unit to the right would determine the amber of bit 
omitted in the partial address. = 
HW, OVERADDRESSING. Varying the length of the accessed word involves 
the addressing of a component of a word, say a 6-bit character. A special 
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character is utilized to define the end of a word. Then to address a word, 
only the address of its first character is used; as the word is read into the 
computing unit sequentially, one character at a time, the end-of-word 
character is sensed, stopping the memory access process (see Fig. 3-4). 
Since each character is being addressed, the number of bits in the address 
will be more than is required by the number of words that could be 
formed in the memory (except in the case where all the words are chosen 
to be one character long)—hence the term overaddressing. This method 
truly represents a variable word length, but the price paid is an address 
with sufficient bits to address the shortest possible word. Also, special 
instructions must be made available to automatically lengthen, shorten, 


or Oo 


gqgegaegesida 
=e NN OY 


mon oraya®g 9 
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Fig. 3-4. Illustrating overaddressing. The figure shows the address (in, say, octal) 
of each character in the memory, where the crosshatched characters are end-of-word 
characters. The address of the first word is 00, and it contains five characters; the 
address of the second word is 06, and it contains three characters; the address of the 
third word is 12, and it contains seven characters; and so forth. 













or initiate words by moving, omitting, or inserting end-of-word charac- 
ters at specified locations. There are many variations as well as combi- 
nations of these methods. For example, by shortening the component 
or character address, we might make the computer read larger (or multi- 
character) components from the memory, until the end-of-word signal 
character appears. 

Indirect and Augmented Addressing. For very large memories, the 
number of bits required for an address may be more than the number 
relegated to the address part of an instruction. In such a case, the large 
memory can be accommodated by having the instruction directly address 
not the desired operand, but rather another word whose entire contents 
will be the address of the operand. Generally the “other word” is & 
high-speed register reserved for the purpose. If there are 2” such regis« 
ters, then r bits are required to address one of them; this is called proper 
indirect addressing. A similar technique is to have the instruction indi- 
cate only part of the address of the desired operand, and also the address 
of a register the contents of which will complete the desired address. This 
augmented addressing can be done on either the most significant or the 
least significant end of the indicated address. In either case, the moat 
significant part of the resulting address is interpreted as denoting the 
part, or area, of the memory to be accessed, and the least significant pa 
gives the address of the desired word within that area, If there are 2 
high-speed registers, each of contents R bits, 2" memory words, and ¢ bit 
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in the instruction address, then in the former case we would need 7 > r 
and R > m for addressing ‘“‘gain,’’ and in the latter case would need pia 
in the instruction for addressing the register, where r +7 < R +7 and 
R+7>m. 

_ A, INDIRECT ADDRESSING. Consider as an example a three-address 
instruetion format with a 27-bit (9 octal position) word. Suppose three 
octal positions are used for the operation, leaving six for the three 
addresses, or two octal positions per address. This will enable a maxi- 
mum-sized memory of 2°, or 64, words. However, if it is desired to 
extend the memory size, indirect addressing must be utilized. Suppose 


Memory-word selection gates 





Register 


Instruction 
Indirect addressing Augmented addressing 


Ka, 3-5. Diagrammatic illustration of indirect and augmented addressing. An arrow 
arising from inside a rectangle indicates that its contents is utilized; an arrow directed 
(o the side of a rectangle indicates the designation of the address of the register. 


it were desired to have 64,000 words for the memory, requiring a 16-bit 
address. To accomplish this, the instruction addresses could refer to 
one of up to 64 indirect-address registers, each of which itself consists of 
16 bits. Then, previous to the execution of the instruction, the appro- 
priate registers would be loaded with the desired 16-bit adressen: When 
being executed, the instruction would refer to the addresses of the regis- 
(ers whose contents are the desired memory addresses; the computer 
would automatically access these desired memory addresses (see Fig. 3-5). 

i}. AUGMENTED ADDRESSING. Consider as an example a one-address 
instruction format with an 18-bit (6 octal position) word. Suppose that 
‘wo octal positions are used for the operation, leaving only four for the 
address, and that 250,000 words comprise the total memory. Augmented 
addressing would then be required. An augmenting register of 6 bits 
would be preloaded, and the desired address would be interpreted by the 
vomputer as being the 6 bits of the register taken together, or augmented 
with the 12 bits of the instruction. If the register address bits are ean 
sidered to be the most significant bits, then the contents of the register 
enn be thought of as denoting which of 64 possible memory banks is 
being utilized, each such memory bank containing 4,096 words (see Fig. 
5), Such a system might be used, for instance, in conjunction with an 
overaddressing technique, as described above, 
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Relative and Compound Addressing. ‘The previously discussed methods 
of this section had as their aim the extension of the computer’s capabilities. 
We now turn to techniques designed to aid the programmer. Relative 
and compound addressing are primarily aids to instruction modification. 

A. RELATIVE ADDRESSING. Relative addresses are associated with 
so-called index registers. An index register is a very special memory cell 
or register that can contain only a single address; i.e., it is a “short” 
memory cell. The computer must have special instructions that can put 
a number into such a register, add to the number in the register, etc. 

An address appearing in an instruction is said to be interpreted “rela- 
tive” to a register if the computer considers, instead of the contents of 
the address indicated, the contents of the word whose address is the swum 
of the index-register contents and the indicated address. For example, if 
the contents of the index register were 002 and the address in the instruc- 
tion were 053, then the computer would consider the contents of 


002 + 053 = 055 


In cases where many index registers are available, the computer instruc- 
tion word will contain, in addition to the indicated address, the address 
of the index register relative to which the indicated address is to be con- 
sidered (see Fig. 3-6). 

The use of such relative addresses is to aid the coding of iterations, 
or loops. As illustrated by some of the problems of the previous chapter, 
each time around a loop it is often necessary to increase some addresses 
of the main set of instructions by 1 (or some other integer). In such 
cases a set of instructions modifying the instruction must precede the 
main set of instructions. But with relative addresses all that need be 
done is to increase the index-register contents by 1 (or whatever is the 
necessary integer for a particular case) by means of a “register-setting” 
instruction. One variation of this system is to have several such index 
registers, which can be used in many different ways. For example, all 
instruction preceding the use of relative addresses can determine to which 
register the addresses are to be considered relative. Or the addresses 
may be considered relative to the sum or some other function of the 
numbers in the registers. 

Relative addressing can be considered as the fundamental concept of 
this section. Then augmented addressing can be interpreted as having 
some positions in the index register correspond to zeros or unrepresented 
bits in the direct address of the instruction itself. Finally, indireet 
addressing can be thought of as being the extreme case, in which t 
direct address is null, i.e., all zeros, and does not actually appear. 

B. COMPOUND ADpRESSING. For greater flexibility, indirect addres 
ing, augmented addressing, and relative addressing can be combined 
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lor example, the indirect address could be augmented, which would 
effectively enable the combining of two registers. Or an indirect address 
could be made relative to the contents of an index register. Another 
combination could be to have the instruction address a register for the 
basic indirect address, and also to address two index registers for relative 
addresses. The desired address then becomes the sum of the indirect 
address and the contents of each index register. By incrementing the 
index registers, a doubly indexed array can easily be stepped through. 
Of course, many such combinations can be made. Finally, each memory 
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Instruction 


Relative addressing 


Compound addressing 


Mia, 3-6. Diagrammatic illustration of relative and compound addressing. An arrow 
arising from inside a rectangle indicates that its contents is utilized; an arrow directed 
to the side of a rectangle indicates the designation of the address of the register. 
In the compound-addressing case, the use of register X may be considered augmented 
addressing, for choosing the proper memory bank, while the use of Y and Z may be 
vonsidered relative addressing for double-indexing purposes. 


word could be made available as an index register. Here we could have 
an address in an instruction word refer to the address of another word 
whose contents will contain the address of an index register (see Fig. 3-6). 

The use of such index registers is associated with four functions: 
(1) address modification, (2) incrementing the index register, (3) index 
or tally testing, and (4) the initial setting of the contents of the index 
register, tally, and increment. The address modification is the forming 
of the desired address as the sum of the index-register contents and the 
indicated address; the incrementing of the index-register contents is fre- 
(uently accomplished by utilizing a counter for the register itself. The 
use of lists and loops requires a tally that signifies the end of the list and 
the exit for the loop; the tally condition is determined by a comparison 
hetween the contents of the index register and some tally constant, which 
may be zero, some number in the instruction, or a number in some other 
word, Special instructions must be defined to set the initial contents of 
the index register, to specify the increment when it is a variable, and so 
forth, These will now be discussed. 
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Recursion, or Loop-aiding, Instructions. These instructions are de- 
signed to aid in the programming of loops. In the examples and exer- 
cises given in this and the previous chapter, each time around the loop 
the main set of instructions usually needed to be modified by means of 
additional instructions. As was seen above, index registers can aid in 
eliminating these additional instructions. Also, loop codes require that 
a tally be kept to allow the computer to jump out of the loop. The 
purpose of recursion, or loop-aiding, instructions is to facilitate the use 
of the index registers and to aid in keeping the tally automatically. 

A, INDEX-REGISTER MODIFIER. Adds 1 to the index register and jumps 
to a. Here the loop tally is not considered. 

B. FILE AND MODIFY INDEX REGISTER. This instruction stores the con- 
tents of one of the index registers (assuming that there are several such 
registers) as (a); then another register designated by the instruction is 
set to the number 6 (that is, not the contents of 6, but 6 itself), and this 
register is then considered the one to be used until another such /ile 
instruction occurs. Since resetting and going to another register essen- 
tially constitutes a jump, this instruction can act as a return jump, 
However, if the number £ of the file instruction itself is considered rela- 
tive to the same register that the file instruction is setting, then this regis- 
ter is set to its original reading plus 8; that is, the register is increased by 8. 
As we have seen in the index-type instructions, resetting a register is an 
aid to recursion codes; however, no tally is considered. 

C. TALLY guMP. Subtracts 1 from the number a@ of the instruction; 
if the result (that is, a — 1) is positive, replaces the number a by the ~ 
number a — 1 (in the a-address position) of the instruction, and jumps 
to take as the next instruction the contents of 8; otherwise the program 
continues sequentially as usual. This instruction does not consider the 
index registers. 

D. MODIFY INDEX REGISTER AND TALLY. Compares the contents of the 
index register with the number a. If the contents of the index register is 
greater than the number a, then the number a is subtracted from the 
index register and the difference is placed in the index register. Then 
a jump is made to 6. Otherwise the program continues in sequence, — 
Note that the index-register contents acts both as tally and as a relative 
address. Of course this must be used in conjunction with an instruction 
that initially sets the contents of the index register. 

In the Appendix we have summarized the use of indirect addressing: 
and relative indexing in some commercially available computers. 


3-6. Special Coding Techniques 


Multiway Branch. Our instruction systems described above allow for 
only a two-way branch in a code, That is, by means of a comparison 
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instruction, the computer can determine along which of two possible 
directions to continue in the code. However, it often occurs that a 
many-way branch is needed in a code; this must be programmed. Just 
us the decision process for a two-way branch was reduced to a comparison 
of numbers, so a many-way branch may be reduced to comparisons of 
numbers. For example, if a code is to have a four-way branch, then, to 
determine which branch is taken in any case, the code might determine 
« number N which can take only the values 1, 2, 3, and 4. Thus one 
way of coding the four-way branch would be a split tree of successive 
comparisons, as in Fig. 3-7. 


Code (calculates N) 





1st branch 2d branch 3d branch 4th branch 


Fig. 3-7. Split tree of comparisons for multiway branch. 


There is another, more efficient. method for coding a many-way branch, 
ising a so-called jump table. Here the number N that decides which 
branch the computer should take is used as an instruction modifier (see 
lig. 3-8). The basic jump instruction is modified so that, when it is 
executed, the computer will jump to the appropriate branch jump instruc- 
lion of the jump table. This branch jump then jumps to the proper 
branch of the code. 

Switch Sensing. In real-time applications of digital computers (see 
Nec, 1-3), it is often convenient to connect the outside world to the com- 
puter by means of a battery of switches. These switches are attached 
(0 a very specially built word in the computer’s memory, and each switch 
controls a single bit of this word. If a switch is on, then the computer 
reads a unit in the position of the word corresponding to this switch; 
i! the switch is off, then the computer reads a zero. Thus at any time 
(his special word gives the status of the corresponding battery of switches. 
let us in addition assume that this special word behaves the same as 
any other word of the computer’s memory, except that the computer 
cannot read into it, 
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There are two ways of interpreting these switches, called on-off switch 
sensing and one-shot switch sensing. For on-off switch sensing the com- 
puter is to determine for each switch 
whether it is on or off. The nth switch 
1 can be sensed in this way as follows: 
J Shift the switch word left until the nth 


bit is in the most significant place; then 
compare this shifted word with a word 
Reset basic jump instruction to 160 | 4 ¢ aj] units except in the most significant 
Modify basic jump instruction to 
jump to 160 + N 
Execute basic jump to 160 + N 


place. For example, consider a one- 
The jump table 


address system with the following shaft 


instruction: “Shift (acc) left the number 
Jump to 1st branch 


Code (calculates N) 








of positions «.”” Thus suppose that the 
switch word were 011 110 101 110 001, 
and that we wanted to determin 
whether P8 (the eighth bit from th 
right) were 0 or 1. We would place t i 
word into the accumulator and perform 
Shift left 7— i.e., shift the word lef 
seven positions. For the comparison, 
negative jump would be used after su 
tracting (acc) from O11 111 111 111 
111. If the result is negative, then th 
shifted word had a unit in the most si 
nificant position, that is, P8 = 1; if th 
result is zero or positive, P8 = 0. 

For “one-shot”? switch sensing, th 
computer senses the switch word peri 
odically at a rapid rate. The computer is only to determine whic 
switches have been turned on. The one-shot switch is not to be sense 
when it is either left on or turned off. Thus one-shot switches are sense 









161 
162 
163 
164 
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Jump to 2d branch 
Jump to 3d branch 
Jump to 4th branch 






Fic. 3-8. Jump table for multiway 
branch. 


TasBLe 3-12. WHEN OnE-sHort SwitcHes ARE SENSED 


a 





New information | Old information Rented 
about switch about switch 
0 0 No 
1 0 Yes 
0 1 No 
1 1 No 








ee el 


according to Table 3-12. To code this, we can use the extract instru 
tion, The old information is extracted with the new onto all units; t 






















Src. 3-6] INSTRUCTION FORMATS AND OPERATION TYPES 115 


in the resulting word a zero will occur in the position corresponding to the 
switch that was just turned on. Hence, each time the switching word is 
sensed, it will be recorded in some other memory location. Then both 
the previous and the present switching word will be available at all times. 
‘lo see why the extract instruction works, consider the following: 


New word (a) 0101 


Old word (8) 0 (0) 1) 
(104 1 
1 *O ‘L 


e 


1 
1 
1 


Ilence only when 0 in the old word corresponds to a 1 in the new word 
will a 0 result. Next, of course, the position of this zero must be deter- 
mined. This can be accomplished by shifting left one position at a time, 
keeping track of the number of times shifted, until a zero is sensed in the 
inost significant position. 


Input-Output. Very often the input-output instructions of a computer 


are not adequate to read in a whole code 
or read out all of a given portion of the Obtain first and last addresses 
memory. In such a case a subroutine 
Set initial condition on read- 
in (out) instruction 
Set tally condition on read- 
in (out) loop 







must be used for reading in and out of 

(he computer. For example, consider 

(he following read-in instruction for a 
onc-address computer: ‘Read one word 

into a.” If an entire code is to be read 

into suecessive addresses of the comput- 

er’s memory, then clearly a subroutine 

(hat iterates this instruction is needed. ; 

lor example, using such a subroutine, 

the following procedure might be fol- 

lowed in reading in a code, assuming 

(hat the subroutine is already in the 

computer’s memory (see Fig. 3-9): The 

lirst instruction of the code to be read in 

will be a jump to the read-in subroutine. F16- 3-9. Flow chart of input-out- 
: aes ‘i ‘i ‘i put routine. 

When this instruction is read into, say, 

uddress 000 and executed, then the computer will jump to the read-in 

subroutine, The read-in subroutine will read in the next two words of 

(he code; these words will contain the initial and final addresses of 

(he area in the computer’s memory where the code is to be put. Then 

(he subroutine loads the read-in instruction with the initial read-in 

wddress and the tally with the final read-in address, and the iteration 

of the read-in instruction begins. When the complete code has been 

road in, the subroutine will jump to the initial read-in address and com- 


ee a ee 


116 MACHINE LANGUAGES [CHap. 3 


putation will proceed. It is probably wise to have the computer stop 
before computation proceeds, to give the computer operator a chance to 
turn off the input equipment, etc. Then, when the run button is pushed 
(see the next section), the computations will proceed. 

Table Look-up. Consider the problem of coding a computer to look up 
a function table that has previously been stored in the computer’s mem- 
ory. That is, given a particular value of the independent variable, we 
desire a code that will select from the computer’s memory the correspond- 


ing value of the dependent variable 
[Address] Operation] @ | 6 _| as recorded in the table. 

ra] [= [ra 
pat 









metHop 1. We have already il- 
lustrated above (see Sec. 2-7 under 
Instruction Modification) one simple 
technique that can be used when the 
independent variable in the table 
takes on successive integer values. 

METHOD 2. Consider now the case 
of nonintegral values for the inde- 
pendent variable, but where the in- 
tervals between successive values of 
the independent variable given in the 
table are all equal. In this case, 
given the value of the independent 
variable for which the value of the function is desired, simply divide it 
by the interval to obtain an integer; thence this case reduces to the one 
considered in method 1. 


Fic, 3-10. Setup of table for table 
look-up routine. 


The interesting case occurs when the values of the independent vari- 


able given in the table do not have equal intervals between them. Here, 
to be specific, suppose that we are in a two-address system and the table 
is recorded in successive cells of the computer’s memory as follows: The 


value of the independent variable is found in the « position of a word, | 


and the corresponding value of the dependent variable is found in the 
8 position of the same word (see Fig. 3-10). Let us call the successive 
values of the independent variable 0, 11, V2, . - «5 % where of course 
tw <t1<%<-°** <2y, and those of the dependent variable f(2o), 
f(ai), f(a), . - - » f(ws), and let us suppose that they are recorded in 
addresses T's, T1, Tz, ..., Ts. We desire to find f(N) for some given 
value N, assuming that N = 2; for some i; that is, no interpolation is 
considered here. 

MreTHOD 3. One simple procedure is to compare successively N :%o, 
N:a,, N:a2, ... , until, for some i, N = 2; then f(N) = f(x) can be 
read from the computer’s memory as desired. This, of course, can be 
accomplished by a simple iteration, 
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METHOD 4. There is another method that on the average will enable 
SN ) to be obtained faster. The flow chart for this method appears in 
Vig. 3-11. The method consists in determining in which half of the table 
x; = N is located; then the halving procedure is again applied to this 
smaller part of the table, ete., until a single cell is left. For example 
consider Fig. 3-10, and suppose that N is x;. Then the first step of this 


Start 
Let To=t,, Ty =ty 


ty-t 
A= = rounded 













A= wt rounded 











down to an integer up to an_ integer 


Fie, 3-11. Flow chart for table look-up, method 4. 


method is to determine that N is included in 7; through 7;._ The second 
step locates N further, as being in 7, or T;. The third step in this case 
locates N as being in 7. The code would be written as follows: At 
any step let tz, and ty be the respective lower and upper addresses of the 
range of addresses in the table under consideration. Weformt’ = ¢, +A 
the address that divides this range of address into two parts where 
A = (ty — tz)/2. Then we look to see whether WN lies in the upper or 
lower half of the range; i.e., is the contents of the @ part of t/ greater or 
less than or equal to NV, that is, ()a:N? If (). > N, then t’ becomes 
the upper address of the new range; that is, > ty. If (ta < N, then t/ 


118 MACHINE LANGUAGES [Cuap. 3 

becomes the lower address of the new range; that is, t! — tz. Of course, oe - 

if (t’)a = N, we have the desired result. There is one little complication, ©) (0) 38 2s 

however. When A = (tu — tz)/2 is formed, it may not be an integer; se 5 

but f’ = tz + A is an address and therefore must be an integer, hence % aes Ea 

must be rounded to an integer. A little reflection will show that, if & (CO) (0) Bea S32 
the nearest smaller integer to = ae 32 


(t’)a > N, then the next A should be 

(ty — tr)/2, while if (ta < N, the next A 

integer to (tu — tr)/2. 
For example, consider 


should be the nearest larger 


Final 
stop 
Stop 
button 


Fig. 3-10 in connection with the flow chart 
of Fig. 3-11, and suppose for argument’s sake that N = 2; and that 
T, = 100, 7: = 101, T; = 102,..., T, = 107. Then we would start 
with t, = 100, tv = 107, whence A = (107 — 100)/2 = 314, rounded 
up = 4. Hence ?’ = 100 + 4 = 104. Comparing (104).:N, we find 
N > (104).. Hence 104 is tz, and we form A = (107 — 104)/2 = 1, 
rounded up = 2. Thus ¢ = 104 + 2 = 106. Comparing (106)a:N, 
find N < (106). Hence 106 becomes ty. Thus A = (106 — 104)/2 = 1, 
whence ¢’ = 104 + 1 = 105. Comparing (105)a:N, find N = (105)a; 
and the result has been obtained. 


Start Run 
button —_ button 
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3-7. The Control Panel 


Purpose of the Control 
functions: (1) to enable t 


Panel. The control panel has three main 
he operation and monitoring of the computer 
during the computation of a program, (2) to aid the checking, or “debug- 
ging,” of a new program; and (3) to aid the maintenance of the computer 
by the engineer. The control panel and the program itself are the 
main communication links between the operator and the computer. By 
means of the control panel, the computer can be started and stopped, 
computations can be initiated, magnetic tapes can be properly positioned 
and other input-output equipment controlled, computer voltage and 
current levels, temperature, power input, etc., can be checked and 
adjusted, and so forth. There are wide variations in the features of con= 
trol panels of different computers. In this section we shall discuss some 
of the possible features that a control panel might have (see Fig. 3-12), 

In general, information is transmitted between the operator and 
computer by means of lights and buttons on the control panel. However, 
there is one method for monitoring a program during the computations 
using neither lights nor buttons that deserves special mention. It cone 
sists simply of a radio placed near the computer tuned to pick up signals 
generated during the computation. The computer, operating at a megie 
cycle or more, presents a radio frequency; recursion loops generate audi 
frequency modulations that can be detecte 

will correspond to the different sequences or loops of instructions in t 

program, <A coder who is computing many reruns of the same code, 
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the current-address counter reads 36363. 


Fig. 3-12. A simple control panel. 
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would be the case in computing mathematical tables, can frequently 
relate these sounds to the various iterations of his program. A trained 
ear can then often detect errors or other troubles that his computations 
may have, just by “listening” to the program while it is being computed. 

Lights and Signals. Usually one first notices the lights and signals 
displayed by the control panel. By means of these signals the computer 
can tell the operator something about its internal state. There are in 
general three kinds of lights and signals: the first kind tells why the 
computer stops during the course of a computation; the second presents a 
visual picture of the contents of various registers and memory cells; the 
third gives the electrical state of the computer. 

When the computer stops or halts, usually one of several lights comes 
on, indicating the cause for the stop; i.e., a stop instruction has been 
executed (called a final stop), or an instruction being executed may con- 
tain an erroneous operation code not meaningful to the computer, or an 
instruction being executed may have reference to an erroneous memory 
address not legal for the computer,t or a check stop due to an auxiliary 
bit may have occurred, and so forth. Control panels are often con- 
structed so that, when the computer has stopped or halted, the contents 
of various registers can be ascertained from sets of lights. For instance, 
there may be two small “grain-of-wheat” neon bulbs on the control panel 
corresponding to each bit of the accumulator. When the top bulb of the 
pair is on, the corresponding bit might be a unit; when the bottom bulb is 
on, the corresponding bit would be a zero. Similarly the contents of the 
instruction register of the current-address counter and of other registers 
and counters can be displayed. If an electronic failure has occurred, 
such as a blown fuse or a change in voltage or current readings beyond 
an allowable limit, appropriate signal lights will come on. Of course 
there will be a pilot light indicating that the power is on, etc. 

Buttons and Switches. The only means the computer operator has to 
control the computer, other than by the code itself, is by the buttons and 
switches on the control panel. Besides the on-off switch for the power 
there is a start button, which initiates the computations. For example, 
when the start button is pushed, the computer may automatically take 
as the first instruction the contents of address 000. There is also a stop 
button; when it is pushed, computation ceases. In addition to the start 
button there may be a run button, which is used to restart computations 
after a halt has occurred in the middle of a computation. In this case 
the difference between the start and run buttons is that the run button 
tells the computer to continue to perform the instruction already in the 
instruction register, whereas the start button tells the computer to initiate 


+ Frequently the full complement of memory address is not constructed for a- 


computer, Hence an erroneous instruction could refer to an address that the come 
puter does not have, 
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a particular standard predetermined sequence of events, e.g., take 000 as 
the next instruction. 

The computer can usually be operated in several different modes. 
lor example, by setting a switch the computer can be made to execute 
only a single instruction each time the start or run button is pushed. In 
this way a sequence of instructions can be “stepped through” slowly by 
successively pushing the run button. This would be done, for example, if 
one desired to see several of the instructions and their results on the lights 
of the control panel that display the contents of various registers. On the 
other hand, a different setting of this mode switch can make the computer 
sequence itself through the instructions of a program at a very much 
reduced rate. Finally, of course, the mode switch can be set so that the 
computations proceed at full speed. Often in the single instruction 
mode, there is a phase-selector switch which makes the computer proceed 
through just one phase each time the run button is pushed. 

So much for the special methods of starting the computer; consider 
hext special methods for stopping or halting the computer. In previous 
sections we have observed how the operation of manual switches can be 
used to halt the computation when auxiliary bits are present. These 
awitches are called auxiliary switches. There can be several auxiliary 
switches, each associated with a different auxiliary bit of the instruction 
word. Then, by setting the proper auxiliary switch, the computer can be 
made to stop at different sets of instructions. Other switches may make 
the computer halt at a particular phase when the stop button is pushed. 

One of the most important uses of a control panel is to insert words 

manually into particular memory locations. Such an occasion may arise, 
lor example, when it is necessary to correct only one or two words in a long 
program that is already in the computer’s memory. There are several 
methods used to accomplish this. One method is to have one button 
associated with each bit of the registers that may be displayed by lights 
on the control panel. When the computer has been halted, the contents 
of a register may be changed by means of these buttons. For example, 
there may be one “clear” button for each register that, when pushed, 
nukes the contents of the respective register all zeros. Then the bit 
huttons of this register are pushed to insert units in the desired bit loca- 
lions. Under these conditions the procedure for inserting a word into a 
particular memory position would be as follows: Halt the computer; 
imunually insert the desired word in the accumulator; similarly insert an 
instruction in the instruction register that will transfer the contents of the 
accumulator into the desired memory location. Set the mode switch 
for a single instruction, and push the run button. The inserted instruc- 
tion will then be executed, and the word will be inserted into the desired 
memory location, Other methods might make use of the external input 
equipment to read a single word into the desired memory address. 
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There is usually a set of switches associated with the input-output 
equipment. Although, as we have seen, the selection of input-output 
equipment can be made by the program itself, frequently provision is 
made on the control panel for the manual selection of external equip- 
ment. Other switches present a means for manually winding, rewinding, 
and positioning magnetic-tape units. More complicated external equip- 
ment such as cathode-ray or television displays of course require still 
additional controls. 

Let us now illustrate a possible procedure for initiating a computation 
by means of our control panel. First the input medium is properly 
set in the input device. Next the input device itself is turned on, and 
the external selector is set to choose this input equipment. An instruc- 
tion is inserted into address 000 that will read in the whole program. 
The accumulator, instruction register, and current-address counter are 
all cleared (why?). The mode is set at high speed, and then the start 
button is pushed. The computer will then execute the instruction found 
in 000 first, initiating the read-in of the program. 


3-8. Exercises 


3-1. By rearranging the order of one of the comparisons together with the order of 
some of the instructions in Table 3-2 above, one instruction can be eliminated—1.e., 
the code will take only 10 instructions instead of 11. Can you find this comparison 
and rearrange the instructions properly? (Hrvr: In address 0005 the instruction can 
be changed so as to jump directly to make W the largest number. But then some 
other instructions must be reordered.) 

3-2. Redefine in detail for a three-address system the add, subtract, multiply, and 
divide instructions given in Sec. 2-4. i ; ; } 

3-3. Do Exercises 2-32 to 2-37, using the three-address instructions defined in this 
chapter. Carry out each of the five steps of coding in every problem. ; ' 

3-4. Relate sets of instructions written for Exercise 3-3 to their respective boxes 0 
the flow diagrams. ; ; ; . 

3-5. Do Exercises 2-32 to 2-37, using the two-address instructions defined in this 
chapter. ; ‘ ’ 

3-6. Do Exercises 2-32 to 2-37, using the one-address instructions defined in this 
chapter. : : ‘ 

8-7. Define one-plus-one instructions corresponding to the one-address instroeaa 
given in Sec. 3-2; then redo the example of Table 3-8. Have a significant number 0 
instructions been saved? ; ; 

3-8. Write a code in three-address decimal-coded binary that will determine the 
largest of three numbers. 


In the following exercises, utilize the three-, two-, and one-address systems given in 
iar ae hea ee cade by Fig. 2-8. Code this in a three-address 
Been What would a vounten nesting. sanmustingy bai. a ane sigme yl 

ae Saat oe aprons Hee ush'ot 77770000 X 2° and 00007777 
2° (where, of course, the numbers are in octal), 
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3-13. Find the unnormalized difference of 0000 7777 < 2° and 7777 0000 X 25. 

8-14. For the numbers 7777 0000 X 2° and 0000 7777 X 2° find: 

The unnormalized major product, unrounded. 

The unnormalized major product, rounded. 

The normalized major product, unrounded. 

The normalized major product, rounded. 

The unnormalized minor product. 

The normalized minor product. 

8-15. What combination of other logical instructions would produce the same result 
‘im an extract instruction, in a binary computer? 

8-16. How can return-jump instructions facilitate the use of subroutines? 

8-17. Use the modify-index-register-and-tally instruction to code the problem of 
Table 2-15. Write the code in a two-address system. 

8-18. In precisely what sense are indirect addressing and augmented addressing the 
ume? 

8-19. Explain why for addressing “gain’’ we must have 7 2 r, Rk 2 m for indirect 
vddressing and r +i < R+i, R +7 2m for augmented addressing, where there 
ure 2" high-speed registers of contents R bits, 2" memory words, and 7 bits in the 
jistruction address. 

8-20. Code both the split-tree and jump-table method for a four-way branch, 
ising a one-address instruction system. 

8-21. Suppose that only a single on-off switch could be on at one time. Code a 
subroutine that will determine the position of a switch that is turned on in a one- 
iiddress system. 

8-22. Write a code for the same problem as Exercise 3-21, but now consider one- 
shot switches. 

8-23. Write an input subroutine, using the one-address input instruction given in 
this chapter. 

$-24. Using a one-address system, with an input instruction that reads in one word 
( « time as described in this chapter, write a read-in subroutine. 

8-25. Write a code for method 3 of the table look-up problem. Make use of an 
rvlracl instruction to extract the successive x; onto all zeros to compare with N; then 
wxtract f(N) onto zeros for the read-out. 

8-26. Code method 4 of the table look-up problem in a two-address system. 

8-27. Suppose that a characteristic ‘click”’ can be heard on the radio monitor each 
(ime the computer goes through a loop. Suppose also (for simplicity in this exercise) 
(hat any instruction takes precisely 100 psec to execute on a one-address computer. 
Write a code that will play the first few notes of the song “Dixie.” (Hirnvr: Look up 
(he nudio frequencies of the first few notes of “Dixie,’’ and code successive loops that 
will repeat at these frequencies.) 

$-28. Describe a procedure for finding out, by means of the control panel, what word 
is in & particular memory address. 

§-29. Suppose that a subroutine were necessary in order to read in a program. 
What would be a procedure for initiating a program by means of the control panel? 

§-80. If a computer needs a read-in subroutine, how is the read-in subroutine itself 
fond in (for the very first time)? (Hrnr: See, for example, chap. 9 in R. 8. Ledley, 
‘Digital Computer and Control Engineering,’” McGraw-Hill Book Company, Inc., 
Now York, 1960.) 

§-81. Write a code of one-address instructions for use in iterative processes, utiliz- 

jiu the flow chart of Fig, 9-6, assuming that f(X,,) is a subroutine. 
§-82. Write a code for Aitken’s method of Sec. 9-4, using two-address instructions. 
Assume that the code is to become a subroutine in a library of subroutines, 
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3-33. Write codes using a one-address instruction system for computing the 
derivatives of y = f(x), using the formulas of Sec. 9-7. 

3-34. Write a subroutine using a three-address instruction system for solving the 
differential equation 
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oy = f(x,y) 


using the first method given in Sec. 9-9. 

3-35. Write a code for solving Laplace’s equation, using a one-address instruction 
system and the method of iteration of difference equations of Sec. 9-10. 

3-36. Draw a flow chart for evaluating Laplace’s equation by the relaxation method 
given in Sec. 9-10. 

3-87. Write a three-address code for determining the prime implicants of a Boolean 
function, given the designation number of the function (see Sec. 10-7). 

3-38. Write a one-address code for determining the prime implicants directly by 
the method of Sec. 10-8. 

3-39. Write a one-address code for determining whether or not two Boolean fun 
tions are independent (see Sec. 10-10). 

3-40. Write a subroutine, using one-address instructions for Boolean matri 
multiplication. 

3-41. Write a subroutine, using one-address-format instructions that will solve 
Boolean equations.. Assume that the designation numbers of each side of the equa 
tions are given and that just the designation numbers of the solutions are to 
generated. 

3-42. Write a one-address program for the flow chart of Fig. 11-9. 

3-48. Write a three-address code to correct three errors, using the scheme ill 
trated in Fig. 11-15. 

3-44. Code the formulas given in Sec. 11-7 for evaluating the probability of fa 
drops in irredundant codifying, using a one-address instruction system. 

3-45. The Minimum Computer. It is clear that a general-purpose computer do 
not really “need” the multiplication or division instructions, for these operations ¢ 
be formed by a combination of additions or subtractions. Similarly other operatio 
can be omitted and replaced by simpler ones. The question arises: 
minimum instructions necessary for a general-purpose computer—or, indeed, wh 
criteria can be applied to determine whether or not a computer is actually a gene 
purpose computer? These questions have long been of interest to mathematici 
and logicians. The so-called Turing machine, or “universal computer,’’ was dev’ 
oped for this purpose (see A. M. Turing, On Computable Numbers, with an Applic 
tion to the Entscheidungs Problem, Proc. London Math. Soc., ser. 2, vol. 42, p 
230-265; a more readable description is given in Martin Davis’s “Computability 
Unsolvability,”’ chap. 1, McGraw-Hill Book Company, Inc., 1958). More recen 
S. P. Frankel has considered the minimum general-purpose computer; see his On t 
Minimum Logical Complexity Required for a General Purpose Computer, JRE Tra 
on Electronic Computers, vol. EC-7, no. 4, pp. 282-285, December, 1958. Frankel 
computer (called the Microcephal AC) can only subtract. Can you progr 
multiplication on Frankel’s minimum computer? 
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CHAPTER 4 


PROGRAMMING FOR SPECIAL-PURPOSE 
DIGITAL COMPUTERS 


4-1. Introduction 


To increase the effectiveness of digital computers in performing cer- 
tain special functions, it is often advisable to introduce special-purpose 
features into the design of the computer. These special-purpose feature 
will of course influence the programming methods required in the use of 
the computer. In this chapter we shall consider some of the problem 
involved in programming for such special-purpose computers. T 
method of presentation is to consider specially selected examples 0 
special-purpose computers, ranging from those which differ radicall 
from the general-purpose computer considered above to those built upo 
the extended general-purpose concepts. It is our hope that examples 0 
such computers—some having no formal instruction systems at all, so 
having no arithmetic functions, some having great deviations from t 
conventional instruction format, and some having multiple code-handlin 
capabilities—will present a broad view of the many possibilities existing 
in special-purpose computer programming. The examples discussed a 
not necessarily based on actually existing special-purpose computers bu 
have been formulated to illustrate a large variety of important concep 
in special-purpose computer programming. 

Qutline. We shall consider first the digital differential analyzer. T 
computer has no instruction system but rather consists of a collection 
computer units that are wired together as required by each particul 
problem. Next we shall consider a logical-control computer, using th 
control of a chemical factory as an example. Here a special ‘externa 
control” instruction is introduced to facilitate the logical computatio 
A special design for a searching computer will then be considered. Th 
special-purpose computer can be incorporated in a general-purpose ¢O 
puter as well. Business and logistics problems illustrate the need f 
special features enabling the handling of large arrays of numbers. He 
a radically different instruction format is illustrated, Tinally we ¢o 
sider the multiple-general-purpose computer, which is essentially 
interlocked collection of computers, used in such a manner as will ta 
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Kreatest advantage of the special-purpose components of which it is 
comprised. In our illustration this multiple-purpose computer will be 
comprised of the computer components discussed in the previous sections. 


4-2. The Digital Differential Analyzer 


Purpose. Frequently it is necessary to compute the value of some 
function point by point. For example, in the automatic-milling-machine 
application described in Chap. 1, the computer attached to the milling 
machine must guide the tool from point to point along a predetermined 
path that was given only grossly at the input, e.g., a linear interpolation 
between two points or a circle of a certain radius. Similarly in many 
other control operations rapid calculations of sin @ and cos 6 may be 
necessary, as in accurately sweeping an area with an antenna or in com- 
puting some functions. Not infrequently the familiar analog techniques 
vunnot produce a sufficient number of significant figures for the accuracy 
required. In such cases one can use a digital differential analyzer as a 
small, relatively simple and inexpensive special-purpose digital computer. 

The Concept of the Digital Differential Analyzer. The digital differ- 
ential analyzer computes successive values of a function by means of 
successive differential additions. Consider the problem of computing a 
inble of values for a function y = f(x). If we start with a given y(2o), 
then y(a1) = y(ao) + [y(x1) — y(ao)] = y(ao) + (Ay). Similarly we 
write y(av2) = y(a1) + (Ay)z,, and so forth. In other words, 


y (Xi) = y(ui) + (AYy)a; 


Now suppose that y = e*. Then dy = e* dx = y dz; or approximately 


(Ay)x, = y(t) (G41 — ai) = y(ai) Ax 
‘hus we can compute successive approximate values of e” by 
y (tins) = y(ui) + y(ai) Ax 


The smaller we make Az, the more accurate our results will be. How- 
wver, there is an obstacle here. Remember that multiplication produces 
i double-length result, so that, if Ax is small enough, the major part -of 
y(v,) Ma will be zero. Thus errors will result when the minor product 
which contains the significant figures of y(a;) Az, is dropped. It seouldl 
(herefore be better to work with the double-length extension of y(a;). 
Ilowever, this is not particularly desirable for what is supposed to be a 
wnall special-purpose computer. 

The use of a double-length accumulator can be avoided; to find out 
iow, let us examine our procedure more carefully. As we have noted, 


pach time we form y(x,) Ax the major product is zero; but certainly the 
Wajor part of y(a) must change eventually, 


This must occur during 
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the addition y(x:) + y(x,) Ax and will be the result of a carry from the 
minor part into the major part. In other words, as we accumulate in 
the minor part, we eventually propagate a carry, or overflow, from the 
minor part of y(x;) into the major part. If we are working in binary, the 
carry can only be a 1 and the major part of y(a,) can change at most by 1 - 
during any iteration. Hence we do not need a double-length accumulator 
at all—we need a single-length accumulator that simply accumulates suc- 
cessive minor parts of y(x;) Az, and a counter that holds the major part of 
y(a;) and adds 1 to the major part of y(«i) each time there is an overflow 
from the minor accumulator. We call the accumulated minor parts the 
residual. 

True multiplication is not essential in finding y(a;) Ax, for we can 
always take Az = 1/24, and we need only shift y(x;) by q positions to 
the right to form y(a;) Az. But since we do not have a major part to our 
accumulator, we can always choose Ax so that the shifting of y(x;) need 
not have to occur actually (although of course it occurs virtually). 

To illustrate these points, consider a 4-bit binary word and le 
Ax = .0001 (see Fig. 4-1). In Fig. 4-1a we have illustrated the accumu 
lator that is to hold the residual and the counter that is to hold the maj 
part of the function value y. The circle represents the componen! 
wherein y is virtually but not actually shifted; this is simply a gate that a 
the proper time passes y to be accumulated with the previous residual 
form the new residual. The dash-dot lines represent the “true” juxt 
position of the double-length value of y. For our illustration, start wi 
xo = Oand yo = e° = 1 preloaded into the counter. Then y(a%o) = 01. 
and y(ao) Ar = 00.00 0100, where 0100 is the residual. Since 


y(ar) = y(ao) + y(xo) Ax = 01.00 0100 


the 01.00 remains unchanged in the counter and 0100 is put into t 
accumulator (see Fig. 4-1b). With y(x:) Ax = 00.00 0100 again, 
have 0100 + 0100 = 1000 as the residual, and 01.00 remaining stil 
unchanged in the counter as y(x2). Then y(a3) Ax = 00.00 01 
whence 1000 + 0100 = 1100 is the new residual and 01.00 remai 
unchanged as y(z4) in the counter. Next y(xs) Ax = 00.00 0100, whene 
1100 + 0100 = carry 1 + 0000; now the counter is increased by 
putting 01.01 in the counter as y(x;) and leaving 0000 as the new residu 
We continue with y(zs) Ax = 00.00 0101, and so forth. The result 
each step and the graph of the function so calculated are shown 
Fig. 4-1b. 
Computing Units. 


The combination of accumulator, gate, and coun 
is called a computing unit of a digital differential analyzer, Many unit 
can be connected together to compute various functions. ‘The conte 
of the accumulator, the residual, is denoted by R; the contents of t 
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Carry steps the counter 
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sB000 01.00].000100].000100). --0100). 
rs 01.00].000100}.001000}. --1000].00---- 
0016 01.00}.000100].001100]. --1100).00 ---- 
0 01.00].000100].010000]. --0000).01 ---- 
100 01.01}.000101].000101). --0101].00---- 
0101 01.01|.000101].001010}. --1010].00---- 
.0110 01.01].000101],001111). --1111|.00---- 
0111 01.01].000101].010100}. --0100|.01---- 
1000 01.10].000110].001010}. --1010].00---- 
1001 01.10).000110].010000}. --0000}.01---- 
1010 01.11].000111].000111}. --0111].00---- 
1011 01.11].000111).001110]. --1110].00---- 
1100 01.11}.000111].010101). --0101].01-- 
1101 10.00|.001000}.001101}. --1101].00-- 
1110 10.00].001000].010101). --0101].01-- 
1111 10.01].001001].001110}. --1110].00-- 
1.0000 10.01].001001].010001]. --0001].01---- 
1.0001 10.10}.001010].001011]. --1011].00---- 
1.0010 10.10].001010].010101}. --0101].01---- 
1.0011 10.11}.001011].010000}. --0000].01---- 
1.0100 11.00].001100/.001100]. --1100].00---- 
1.0101 11.00}.001100}.011000). --1000].01---- 
1.0110 11.01].001101],.010101}.--0101). 
1.0111 11.10/.001110].010011].--0011]. 
1.1000 11.11].001111|.010010}.--0010]. 








Fia, 4-1. Concept of the digital differential analyzer. 
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counter, the functional value, is denoted by y; the overflow, or carry, 
from the accumulator that steps the counter, Le., the differential value 
that is to be added to some functional value, is denoted by Ay (see Fig. 
4-2). The gate may cause either addition or subtraction of the contents 
of the counter (virtually but not actually shifted) to the accumulator. 
As an example of how units can be com- 

Accumulator bined, consider now the problem of comput- 

ing sin x and cos x. Let 


yi = COS x and yo = sin x 
Then the differential equations are 


dy, = — sinada = —y2, dx 
and dy, = cosxdx = yi dx 


Thus we have 





yr(tig1) = yrsri) + AY Ay, = —Y2 Ae 
Fic. 4-2. Computing unit of and 
digital differential analyzer. y2(tin1) = yo(ai) + Aye Ayo = yi Ax 


Hence the arrangement shown in Fig. 4-3 will compute y1 and y2. 
Programming. There are three steps to programming a proble 
First, the proper differential equations must be worked out so that A 
appears in the form y; Av. Second, the connections between units m 
be diagramed. And finally the scaling must be worked out. Consid 











y, = COS x 


Fig, 4-3. Combinations of digital-differential-analyzer computing units to compl 
y, = cos a and y: = sin 2. 


this latter step first. Note that, since the cownter can be increased only 
1 at a time, then during any iteration y can increase only by 1. In ot 
words, the graph of the function cannot be steeper than 45°. ‘Thus in 

example of Fig. 4-1 we could not compute e* for any greater values of 
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Ilowever, the slope of any f i j 
y function can be adjusted by scali i 
does not exceed 45° within the range of computation. Hee 
CORSE as another example the problem of computing the product 
Yi = yry2. Since dys = y; dy2 + ys dy:, we have 


Ys(i41) = ys(ai) + Ays Ays = yi Ayo + y2 Ay 
Yo(ti41) = Yo(ai) + Aye 
and YilXiga) = yi(ai) + Ayn 


Hence the configuration of Fig. 4-4 will compute y; = yi¥2. Note that 





2x Ax 2Ax 





Fig. 4-5. Computing y = 2”. 


the counter for ys has two inputs and that these must be electronicall 

arranged so that they do not step the counter at precisely the same fae 
Note also that not all of the computing unit for y3 is used i aeties 
lonture of this arrangement is that the inputs to the gates are nos i 1e 
Av, but rather Ay; and Ay, themselves. ik 


Next consider the computation of y = 2%. Here dy = 2x dx and 


Uti = yi) + Ay, Ay = 2x, dx; Fig. 4-5 shows the arrangement. 


iT I ne te that one ¢ i 
ere te Ut y -] of the count rs 18 loade 
; e d Ww ith the constant 2 and 


linally consider the computation of y = 1/2, that is, division, Here 
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dy = (—1/z?) dz; letting w = —1/2?, we find dy = w dx and 


[Cuap. 4 


2 2 1 
Thus y(aix1) = y(ai) + Ay Ay = w Ax 
and w(aiz1) = wai) + Aw Aw = —2y Ay 


The arrangement of units for this computation is left as an exercise. 


4-3. Real-time Logical Systems Control: A Real-time Control Compute: 


An Example. Very often the primary function of an on-line digit 
control system (i.e., a system connected to a dynamic process) is to ma 
sequences of logical decisions that depend on the state of certain variabl 
associated with the world external to the computer. Here we shall co 
sider the special-purpose real-time logical-control computer and sho 
how it might be applied in a specific example of controlling a chemi¢ 
factory. In discussing first the specific application, the reasons for t 
various aspects of the control computer will be better appreciat 
Although the example is concerned with the control of a chemical facto 
performing a specific chemical process, the situation is conceptual 
similar in almost all such real-time control problems as may occur 
other types of automatic factory control, in tactical gunnery contr 
or in distributing tracking data to command centers according to p 
determined strategies. The example also contains a feedback loop 
which the results of the input digital disposition influence the outsi 
world, which in turn again influences the input to the computer. 

Figure 1-9 on page 16 shows the flow chart of a chemical facto 
manufacturing Al,O;. Briefly, clay is mixed with water and a deterge 
to make a wash containing the raw material. A certain amount of t 
wash is tapped off into an evaporation tank, where it is concentrated 
evaporation. Then in another tank HCl gas is mixed with the wash un 
it is saturated. The wash is centrifuged and a precipitate of AIC];6H 
is obtained. The precipitate is decomposed, by heating, into gase 
HCl, which is reused, and into solid Al,O3, the desired end produ 
It is of course understood that, even though this problem is based on 
actual process, it has here been grossly oversimplified and actu 
stylized. It is not meant to be examined from a chemical-enginee 
point of view, since many details of both the necessary conditions 
desired results are but casually indicated. 

The state of the chemical factory is given by 16 signals, listed in Ta 
4-1. The computer periodically senses these signals, generating f 
them 12 control signals that direct the future state of the factory. 
future state of the factory results in changes of the input signals, whe 
new control signals are generated by the computer, and so forth, Ta 


signals. 


signals are on and which are off. 


lable 4-3; in the table under each bit of the word is shown the state of the 
forresponding signal. We can similarly use a word for each of the sig- 
ilicant sensed-signal conditions of Table 4-2, so that the conditions can 


lw matched with the state of the sensed signals at any time. For exam- 
ple, the first condition, L(1)-€-T(C) - - - 
tc 
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|-2 gives the relation between the sensed-signal conditions and the neces- 
wary control signals to be generated. A symbolic notationzl form for each 
al a is also given, where - means and and — means not (i.e., no 
aignal), ay 
Programming Logical Control. The computer will have one subroutine 
lor cach set of the control signals to be generated. Hence a jump table is 


TABLE 4-1 SIGNAL SHow N THE STATE OF TH Cc EMICAL Facto Y 
i. S ING E H IC TOR 


Signals generated by factory Symbolism 


Level of L(1) reached.................... L(1) 
Concentration test good.................. Cc 
Time for concentration test............... T(C) 


Valve 4 open 


eaveissousrira heals nein ates cfatatoaseany V(4) 
Level of L(2) reached.................... L(2) 
Tank 2 empty........... 0.00. .0 eee ee eee E(2) 
Tanke: 3: full vice eacecd once ea hott Sakae F(3) 
Temperature T'(1)............. 0.0.0.0. T(1) 
Temperature 7'(2)....................... T(Q) 
Saturation test positive.................. Ss 
Tank 3 empty........... 0.00.0 cece eee E(3) 
Valve 7 open........0. 0 cece eee eee ees V(7) 
Centrifuging finished.................... C(F) 
Centrifuge empty................0...00. E(C) 
Pressure of HCl too high................. P 
Enough volume.............0. 000.000.0505 WwW 

wquired to initiate the proper subroutine for a given state of the sensed 


However, we need an input to the jump table that tells which 


jump instruction to use, and we shall first describe how this input may 
hw accomplished. 


At any instant the bits in the signal-sensing word (Fig. 1-9) tell which 
For instance, the word might be as in 


, would be represented by 
But what about the rest of the bits? Condition L(1)-€-T(C) 


(loon not mention the rest of the signals because for that condition the 
fiuke no difference. 


They are to be ignored. Thus we need another 


word that tells what positions are to be considered and what positions 
ie to be ignored. The second word will have units corresponding to bits 
Phat are to be considered and zeros corresponding to bits to be ignored 
Vor example, L(1)-€-T(C) is denoted by the following two words: 


1lOle wear avex were 
1110 0000 0000 0000 
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where the x means that it 
unit. 


1st word: 
2d word: 


There would then be two words representing each of the 10 conditions of 


Table 4-2. 


TaBLE 4-2. ConTROL SIGNALS 


Sensed-signal conditions 


0001 1000 
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makes no difference whether the bit is zero Or 
Similarly the fourth condition, V(4)-L(2)-E(3), 


gaxv0 lave axalx xaxrre 
0010 0000 


Symbolic 
representation 
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is denoted by 


Corresponding signals 
to be generated 











1. Level is at L(1), and con- 


centration test is bad, and 
time for testing is due 


. Level is at L(1), and con- 
centration test is good, 
and time for testing is due, 
and tank 2 is empty 

_ Level L(1) has not been 
reached, and valve 4 is not 
open 

. Valve 4 is not open, and 
level is L(2), and tank 3 is 
empty 

. Temperature is 7'(1), and 
tank 3 is full, and valve 7 
is off 

. Temperature is 7'(2), and 
tank 3 is full, and valve 7 
is off 

. Saturation test is good, 
and valve 7 is closed, and 
centrifuge is empty 


. Centrifuge finished 


. Weight is enough......- 


ee 


TasLe 4-3. EXAMPLE OF SIGNAL-SENSING Worv 


LO “ve tsowr ye 


(1) 0+ (TC): V4)» £02) + BC) + FCB) * TCA) * 1(2) 8+ £(8) » 0(7) CCF) BCC) P 


. Pressure of HCl high... . 


L(1) -€-T(C) 


LQ) + C+T(C) - E(2) 


L(1) - (A) 


V (4) - L(2) - EQ) 
T(1) -F(3) - V(7) 
T (2) -F(3) - V(7) 


S-V(7)- E(C) 


C(F) 


P 
|W 





0 1 p! 0 


Throw away solution by open- 
ing valve 3, and add clay, 
detergent, and water in tank 
1 by opening hatch 1, valves 
1 and 2 { 

Tap off certain amount of fluid 
into tank 2 by means of valve 
4 





Add detergent, water to tank 1 
by opening valves 1 and 2 


Transfer fluid to tank 3 by 
opening valve 5 


Cut rate of HCI flow to slow by 
closing valve 6 


Increase rate of HCl flow to 
fast by opening valve 6 


Close flow of HCl by means of 
valve 6, transfer fluid to 
centrifuge by valve 7, a 
start centrifuge 

Put precipitate in furnace b 
opening hatch 2, throw awi 
wash by opening valve 8 

Lower heat of furnace 

Take out resultant AleOs 
opening hatch 3 





0 0 0 1 0 Tg 
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Figure 4-6 is a flow chart of a real-time operating program that will 
compare each of the conditions with the sensed-signal word until a favor- 
uble comparison is obtained; then, depending on which condition com- 
pared favorably, a jump will be made to the proper signal-generating 





















Call in new 
signal-sensing word 


Set i=1; 
set jump tally N=1 


Using 2d word of condition i 
as a mask: 


(a) Extract 1st word of condition i 
into condition temporary 


(6) Extract signal word into signal 
temporary 





Compare if (condition temp.) = (signal temp.) 


Unfavorable 
comparison 


Favorable 
comparison 


Jump to 
subroutine N 


Generate 
signals of 
subroutine N 











Add 1 to 
jump tally NV 


Fic. 4-6. Flow chart of real-time operating program. 


wibroutine. After completing the subroutine, a new sensed-signal word 


will be called into the high-speed memory, and the process will start 
over. If no condition compared favorably, the program would call in a 
ww sensed-signal word and repeat the comparisons, and so forth. 
Description of a Special-purpose Logical-control Computer. When 
\indreds of signals must be sensed, as would occur in a more realistic 
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case, it may be more efficient to have the two extractions and comparison 
made in one step by a special computer instruction, which we shall call an 
external-control instruction. Such an instruction would have three 
inputs: the sensed-signal word and the first and second condition words. 
If the comparison is unfavorable, the instruction would automaticall 
inerease a relative counter. This instruction would repeat itself as th 
next instruction. The addresses of the two condition words are take 
relative to this counter so that they are automatically modified appropri 
ately for the next condition when the external-control instruction repeats 













Wired control - 
signal generator 





Signal-sensing 


External 
control 
instruction 












arithmetic 
unit 


Fic. 4-7. Real-time logical-control computer. 


itself. If the comparison is favorable, the next instruction is taki 
from the address of the external-control instruction plus the coun 
reading; i.e., the external-control instruction would be followed by t 
jump table. The method by which the comparisons are repeated is | 
as an exercise for the student. 

For a permanent installation the subroutines generating the cont 
signals can be wired in. Hence the real-time logical-control compu 
need contain little more than the external-control instruction, an instr 
tion to clear the relative counter, a comparison instruction, a ju 
instruction, and an add instruction (see Fig. 4-7). 

Since the program will not often be changed, it can be inserted in 
the computer by means of the control panel; therefore no input-out 
equipment is necessary except the signal-sensing word and the wi 
control-signal generator. The memory of such a special-purpose C0 
puter need be only a little larger than necessary to accommodate 
condition words. The speed of the memory need only be fast comp 
with the feedback reaction time of the system it is controlling. lor 
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chemical factory here described this need not be particularly fast; for a 
(racking system it must be very high speed. 


4-4. Special-purpose Information-retrieval Computer 


An Example. The occasion may frequently arise where a special- 


purpose searching, sorting, and ordering computer becomes feasible 
Asa specific example of the use of such a computer, which will ‘fdicata 
the motivation for its design, consider a contadcuunting system in 
factory. Each workman in the factory is given a job; he ori 
wupphes and parts; he spends time using the supplies and aesateblin tie 
parts to produce some final object; then he may be RES j b 
(which may be a repetition of the first), ete. The executives and acc 
its of the factory would like to know how many objects of ihe ‘ 
have been completed, how many supplies and how many parts were He 
during a particular period of time, how many hours a particular workman 
apent on a particular job, how many man-hours altogether were spent i 
taking a certain number of these objects, and so forth. The issgn sen 
fiven here of a cost-accounting system is necessarily pera lified, but 
the basic principles and concepts are the same. : 
Suppose that each procurement of supplies and parts, each object 
ompleted, the time consumed in each job, etc., were recorded b ae 
iuimber, workman’s name, job number, date, and so forth, at the renee 
fwlion was taken or the job completed; suppose that all this information 
for | week could be compiled on a single magnetic tape. The problem 
(hen reduces to searching, sorting, and ordering the items on the tape to 
‘iiewer the above questions. In general a computer that can seouh 
(hone operations need be capable only of comparing, counting, and 
alate Whether or not such a Bpedialswurnoss a is 
unible de ; j i i 
_ see Haosebn bis cost and on the time it takes to process the neces- 
A Small Retrieval Computer. Let us suppose that any item can be 
feoorded in 50 characters or less (a character being 6 bits). Then the 
Homputer will first read a set of 50 characters into the cemmnerany storage 
i Honending loop of tape (see Fig. 4-8). Selected characters will be oe 


jintely compared with characters from the previously loaded argument, 
slorage, a second loop of tape. Depending on the outcome of this com- 
jiivinon, the computer may then add into a partial sum in the accumulator 
ine number represented by certain of the 50 characters, or may increase 
the count of the counter, and so forth. Actually several sets of data 
tiiy be recorded in the temporary storage unit. When writing back onto 


tie magnetic tape, the computer can transpose sets if it is ordering the 


formation, o i i i 
, or can insert new information, and so forth. In summary, 
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successive items from the magnetic tape are stored temporarily in the: 
computer; here information is extracted from them, or they are otherwise 
processed; then they are replaced on the magnetic tape, perhaps in some 


Magnetic-tape unit 


Temporary 
storage unit Argument 


storage unit 





Simple 
arithmetic 
unit 


DCI 
= 


| Counter | 























Fic. 4-8. Small retrieval computer. 


transposed order. The processing is done in conjunction with info 
mation preloaded on the argument storage unit. 

The number of instructions to be written into a program will b 
limited to the number of instrue 
tions that can be executed from th 
time an item is read from the tape 
until it is put back again. The in 
struction system can be very simpl 
Suppose that the instructions we 
punched on cards and the cards fo 
one program put into slots in th 
control unit, where the holes @ 
the cards can be sensed. The af 
guments on the argument-storag 
unit tape loop, which are fixe 
during any program, can also 
addressed. The instruction format could be as shown in Fig. 4 
Each instruction is concerned with selected characters, as recore 
of the current item in the temporary storage unit, 


Instruction card 


Selected characters 


Operation 
2d argument 6 
Next instruction 
Next instruction 6 


3 
~ 
i 
o 
(= 
3 
a 
= 
6 
~ 
a 
e- 





Fic. 4-9. Instruction format. 


on the cards, 
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instruction may involve two of the arguments of the argument stor- 
ge unit if, for example, a search for characteristics less than a but greater 
than B is desired. A two-way exit, y or 6 depending on the outcome of 
(he comparisons, is allowed for flexibility in coding. Consider as an 
illustration the computation of the number of hours in a week that work- 
man 1224 required to make part 2746 and the number of parts he com- 
pleted in that week. Suppose that 40 were the code for an item recording 
finished job. The instructions would compare the properly selected 
characters of each item with 1224, and 2746, and 40. When an “equal”’ 
(omparison was made, the computer would add 1 to the counter and add 
(he number of hours recorded on this item to the accumulator. If an 
equal comparison were not made, the computer would just replace the 
liem on the tape. By this means the total number of man-hours worked 
in making a particular object during any specified time period can be 
determined in one pass through the tape. Similarly the number of man- 
hours that were worked by a certain person, or in a certain shop, etc., or 
(he number of a certain part that were consumed, or the amount of a 
wortain material that was used, in any shop or by any person, in making 
\ particular object, during a specified time, can be determined in a single 
puss, and so forth. Suppose that the magnetic tape runs at a rate of 
100,000 bits/sec, i.e., about 17,000 characters/sec (6 bits = 1 character), 
or 340 items/see (50 characters = 1 item), or 20,000 items/min. Sup- 
poxe also that there are 500 workmen in the factory, each generating an 
average of 5 items/day, or a total of 2,500 items/day, or 50,000 items/ 
month (20 working days = 1 month). One pass through the tape con- 
jaining 1 month’s items would take approximately 2 min and 30 sec. 
Comparison with more conventional bookkeeping methods is hardly 
jecessary. On the other hand, if we were to run a standard payroll calcu- 
\ution on the man-hours worked for each of the 500 men, it would take 
(500 men)(2 min 30 sec) & 20 hr. Also, we have not mentioned the 
input and output equipment problem, which in the case of payroll, for 
lnutance, is a subject in itself. 

Integration with a General-purpose Computer. In order to increase the 
working speed of this type of computer, we can enlarge the argument 
slorage unit and include more accumulators and counters to accommodate 
nore than one comparison, so that in one pass several pieces of informa- 
\ion can be accumulated; or we can run several tapes in parallel, and so 
forth, The time per information pass can thus be decreased by a factor 
of 10 or 100, that is, to within 15, or even 1.5, sec. This would increase the 
wmplexity and hence the cost of the computer. Finally we can attach 
Hur special-purpose searching, sorting, and ordering computer to a gen- 
eral-purpose computer, enabling more complicated processing to be 
wocomplished as well as more automatic operation (see Fig. 4-10). In 
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such a case the tapes can be searched at the same time as the general- 
purpose computer computes other things. The fixed card memory 0 
the special-purpose computer will be replaced by a part of the high-speed 
memory shared with the general-purpose computer. Two codes will be 
written, one for the computing unit of the general-purpose computer, 
the other for the simple arithmetic unit of the special-purpose computer 


General-purpose 
digital 


Concurrent 
searching, sorting, 
and ordering unit 


Buffer 


interlock computer 





Fra. 4-10. Special-purpose computer attached to general-purpose computer. 


An interlocking buffer must be designed so that the general-purpo 
computer will stop and wait if it needs some information not yet retrieve 
from the special-purpose auxiliary computer, and so that the searchi 

computer will stop and wait if it in turn needs information not yet gem 
erated by the general-purpose computer. An auxiliary computer for th 
special purpose can be exceedingly effective and timesaving when lar 
amounts of this kind of nonnumerical manipulative computing @ 
necessary. 


4-5. Manipulations with Rectangular Arrays: A Business and 
Logistics Computer} 


The Need. An important class of computations occurring frequentl 
in business and in logistics (i.e., processes involved in supply) is co 
cerned with rectangular, matrixlike arrays of variables. The operatio 
to be performed on the variables are usually very simple, the main pro 
lem being the manipulations necessary for handling such large-se 
arrays. In this section we shall first indicate some applications of the 
techniques; we shall then show how a few relatively simple speci 
purpose instructions can significantly aid these processes; and finally 
shall discuss a special-purpose computer especially designed to perfo 
manipulations with rectangular arrays. 

As a first example, consider the keeping and updating of a spare-par 
inventory. Suppose for simplicity that there are 10 parts under ¢o 
sideration and that the first row of the array in Table 4-4 represents t 
desired inventory for each of the 10 parts—i.e., there should be 100 each 
the parts in stock. The second row represents the actual stock level 


+ The main concepts discussed in this section are based on the original research 
Dr. W. H. Marlow, principal investigator of The George Washington University Na 
Logistics Research Project, 
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cach of the parts at time ¢. Now let us suppose that the parts are con- 
sumed in the course of three kinds of overhauls and that the number of 
parts used for one of each kind of overhaul appears on lines 4, 5, and 6, 
respectively. Now suppose that since time ¢ there were one overhaul of 
kind 1, two overhauls of kind 2, and three overhauls of kind 3. The 
problem is to determine the present stock levels, i.e., to adjust row 2, and 
to determine how many of each part should be ordered in order to have 
(he desired number of spare parts on hand, i.e., to determine row 3. To 
(uke account in the inventory levels of the parts used for the overhaul 
of kind 1, subtract row 4 from row 2, column by column; for the two 


TaBLe 4-4, Dara ARRAY FOR INVENTORY-UPDATING PROBLEM 





Part] Part| Part) Part) Part) Part) Part] Part) Part) Part 
1 2 3 4 5 6 7 8 9 | 10 
ltow 1, desired inventory........ 100} 100 | 100) 100 | 100} 100) 100 | 100} 100) 100 
Row 2, stock on hand (time t)....| 80] 75} 90} 80) 95) 90| 85] 95] 70} 75 
ltow 3, parts to order........... 
tow 4, overhaul kind 1, parts 








MSOd.. sigcns <cdeetad she 26 end ne dats 5} 15] 5] 10 5 0; oO; 5} 10 5 
tow 5, overhaul kind 2, parts 
WSO. fcc ieettepice Hacdeeetene wearers 10; O} Of} 10; 10} 20} 15) 5] 10} O 
































overhauls of kind 2 multiply each figure in row 5 by 2, and subtract the 
result from row 2, column by column; and similarly form 


Row 2 — 8 X (row 6) 
We thus obtain the present stock levels in row 2: 
40 60 55 20 55 20 10 20 40 55 


Mubtracting row 2 from row 1 and putting the results in row 3, column 
hy column, we obtain the number of parts to order of each type. Of 
“ourse many other related problems can be similarly solved; e.g., we may 
hecord the usage of parts in each time period and then extrapolate column 
hy column to predict usages in the future, and so forth. We have used 
only 10 different parts, but in realistic cases thousands of parts may be 
hivolved. 


\nother example is the calculation of building-material purchase 
whedules and requirements as functions of time, as would be needed in 
the construction of a ship or of a large office building. An array can be 


tiade of the materials needed at the construction site as a function of 
That is, successive rows of the array will represent successive 


ee 
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times, and the columns of the array will represent the various building 
materials; then an element in the array will indicate how much of a 
certain building material is needed at the building site at a particular 
time. However, there is a certain time lag that must be allowed for the 
delivery of each of the materials from the time it is ordered. This time 
lag differs for different materials and can be represented as a row of the 
array. The problem here is to schedule the orders so that the materials 
will arrive at the site at the proper time. In addition the costs of the 
materials per unit can be represented in another row. Tf the materials are 
paid for at the time they are received, then a calculation should be made 
as to how much money will be spent during each time period. This 
calculation can be made after the ordering schedules are set by multiply- 
ing the number of units to be received in a time period (i.e., a row) b 
the cost of each per unit and summing over the elements of the resultin 
row, i.e., adding the columns of that row independently. 

The keeping of bank accounts presents similar array-manipulatio 
problems. Here suppose that each row contains the balance of each 
the savings accounts for successive weeks and that interest is compute: 
monthly, based on the minimum weekly balance. Here the minimu 
of the rows each month must be obtained and the interest rate applied t 
each account (i.e., each column of the minimum record row). If th 
bank desired to know the total deposits each week, the row for each wee 
is totaled. In this way calculations of interest, principal, penaltie 
interest rates, and other banking problems can be made. 

Military and nonmilitary dynamic-operations simulation studies ca 
involve these processes. For example, consider air-traffic control (i 
three dimensions). ‘Three rows can record the x and y coordinates an 
altitudes, respectively, of the aircraft represented by the successi 
columns. If the x and y velocity components are known, the coordina 
of the aircraft can be predicted in time and aircraft that may be flyi 
too close appropriately warned. When changes in course are ma 
recalculations of predicted positions of all the aircraft become easy. 

Of course, we could describe above only the simplest of applicatio 
Our intention is to stimulate the reader’s awareness of the great varie 
of fields in which problems arise that involve the manipulations of la 
arrays. Althougha general-purpose computer can handle such proble 
special-purpose computers can fulfill the need much more effectively a 
efficiently. We shall now describe some special-purpose operations t 
would be incorporated into such a computer. 

Basic Operations and I nstruction Format. With the above examples 
mind the reasons for the definitions of the following basic manipulat 
operations are clear. In fact it is quite surprising that these few sim 
operations can accomplish the task so well. The instructions that W 
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shall illustrate contain the necessary ingredients, although their form in 
actual practice may vary greatly. It is felt that a concrete example may 
best serve to demonstrate the concepts. 

Since we are concerned primarily with arrays of numbers, ie., with 
numbers arranged in rows and columns, suppose that we literally arrange 
these numbers in such an array in the computer’s memory. Figure 4-11 






Column 
pen 2 123 4 5... 
ow SssssgsSsSsSsssSarT 
Vfl. fo ff fl ffl Jf 
Row 2 
Biss ET CY SG EE 





Fia. 4-11. Array of numbers stored on drum memory. 


represents such an arrangement on a drum memory. Then in order to 
“address” a particular number, all we need specify is its row and column. 
‘or reasons that will presently appear clear, we denote the number in 
ow j and column z by the following functional notation: f;(7)—see, for 
example, Table 4-5. 

The address of f;(z) is thus specified by the pair of indices: j, x. We 
me the phrase generalized address because the instruction actually indi- 
utes a collection of addresses in the manner now to be described. Con- 
sider a multiplication instruction. Rather than form the product of 


Tasie 4-5. NoraTioN oF THE ARRAY 


Column 2 
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just two numbers, f,(x) ° f;(a), we wish now, with a single instruction, 
to form a whole row of successive column-by-column products, that is, 
fl) +f), and fi(2) - f(2), and f(3) - f;(3), and so forth. We might 
represent this as the operation fa) - f(z) for « = 1, 2, 3,..-- To 
add a little more flexibility to the operation, we might form, with a single 
instruction, f.(x) - fi(x) forp Se <q, that is, the successive products for 
only the columns p through g. For example, if i = 1,7 = 2, p = 5, and 
q = 8 we would form (see Table 4-6) 


fi(5) ‘ f2(5) f1(6) . f2(6) filZ) - fo(7) 


Thus the multiplication instruction must now contain the row indices 
i and j and the column indices p and q; this collection of indices repre- 
sents the so-called “generalized addresses.”” Suppose that we wish the 
results to be placed in row k of the corresponding columns p to q; then the 
index k must also appear in the instruction. Thus i, 7, and k, together 
with p and q, represent the generalized a, 8, 7, and 6 addresses, and all 
these indices must appear in the instruction format itself. 


fi(8) + f2(8) 


TaBLe 4-6, ILLUSTRATION OF OPERATION fila) -fo(z), 5S ax <8 


CO Re see ee 
Column 











Row 1...... 9 7 8 6 1 5 2 4 3 1 
Row 2...... 6 2 5 1 0 3 4 8 7 2 
Row 3...... Shes, ‘gate, Jaton nay Tacvee 15 8 32 21... 
en a SS 


A description of the fourth generalized address, that of the next 
instruction, requires reconsideration of the drum memory. An instruce= 
tion would appear as a partial row of numbers and can be recorded along 
with the data on the drum. For greatest efficiency a row length on the 
drum should be a multiple of an instruction length. To address the 
part of a row that is an instruction, only the row and column of the first 
number of the instruction need be known, since the length of the instruc 
tion is fixed by the instruction format (see below). Let us denote the 
row and column number of the first characteristics of an instruction by 
§, and 6,, respectively. These must also appear in each instruction word, 

There are several other considerations that can lead to greater flexi« 
bility in our instruction format. F irst suppose that we desired to for 
C+ filx) f(x) for pS x <Q), where C is a constant. Since this 
frequently desired (e.g., interest, and discount rates), it is convenient t 
include the constant C in the instruction format itself, rather th 





























Operation 
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address it separately. Second, observe that so far we can work on 
the different rows within the same column but we cannot have a number 
in one column affect a number in another column. Thus, for example 
the product fi(x) : f;(z) becomes fi,(x) in the same column. To eicaine 
vent such a restriction, it suffices to introduce into the instruction format 
(he alternative of placing the result of the operation in corresponding 
columns in the kth row, or of placing the result one column to the left in 
the kth row, t.e., displaced one column. That is, one may choose to have 
the product f.(x) « f(x) become fi(x) or f(a — 1). Finally, recall that 
(y) is usually replaced by the result of the operation. However, great 
{lexibility results if there is introduced into the instruction the ditemnatis 
of cither replacing (y) or else accumulating (adding) the results of the opera- 
tion to the original contents of y. As an example, suppose that, given the 
initial rows 1, 2, and 3 of Table 4-7, we formed fi(z) - fo(x), for5 < x < 8 
(that is, 2 = 1,7 = 2, p = 5, q = 8), and accumulated the result in row 3 
displaced. Then we would have 


fs(a — 1) final = fila) - fo(~) + f3(a — 1) initial 
for 5 <x < 8 (see Table 4-7). 


TasBLe 4-7, ILLUSTRATION OF OPERATION fi(x) - f2(x), 5 < & < 8, PLacEpD 
iN Row 3 Disptacep By ONE CoLuMmN, AND So as TO ACCUMULATE 


Column 





0 1 2 3 4 5 6 7 8 9 








Row l.......... 9 7 8 6 1 5 2 4 3 #1 
Row 2.......... 6 2 5 1 0 38 4 8 7 2 
Row 3, initial... . 3 2 4 6 2 1 5 3 2 6 
Row 3, final... .. 3 2 4 6 17 9 387 24 2 6 





With these considerations in mind we can represent an instruction 
format as 












Displaced | Accumulated C 


Row j | Rowk | p | ¢ t 
or no 


code 





We shall use the convention that 1 means displace, 0 means do not, and 
(hat | means acewmulate, 0 means do not, in their respective columns. 
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The instruction that will produce the final result shown in Table 4-7 is 






Operation | 7 | Jj 







































Multiply | 1 | 2 


We are now ready to define seven basic operations, where the column 
by-column disposition of each type of result is understood to be in accor 
with the above discussion, and where only the column-by-column ope 
ations on the arguments need be given. 
1. Multiplication 
Cfi(x) f(z) pees 

2. Division 

C(x) 

Si(x) 





3 
lA 
8 
lA 
SS 


3. Linear combination 


Cfi(z) + fiz) — P 


min[fi(z), fi(e)] prs 


lA 
a 
A 

a 


4. Minimum 


The final three instructions require some additional explanation: 
5. Displaced linear combination Cfi(x — 1) + fi(x). Here the ad 
tion is not performed column by column, but rather displaced column 
column. Of course, it is always understood that p < x <q, but the 
when x = p and p= 0, the meaning of f(O — 1) is undetermin 
We therefore define f;(0 — 1) = f,(0). 


6. Partial scalar product y C- fiu)- f(u). This defines one result fi 
H=P ; 
each x such that p <x <q. For example, if row 7 and row 7 were 











Column 
012 3 4 5 
Row 7......... 765 4 3 2 
Row j.......-- 23 4 5 6 7 








and p = 2 and q = 5, then the results would be 
5 X 4 = 20 5x4+4xX5=40 5X4+4K%5+3X6™ 
and 5K4+4XKX54+38K64+2X7 = 72 
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7. Compare ( min [f;(x)]):C. This operation does not yield a column- 
PES 


hy-column result ; rather it is a two-way exit. If the minimum number in 
row 7, columns p to q, is greater than C, then the next instruction is to be 
{uken as the instruction that has row 7, column k as the address of its 


first number; otherwise, i.c., if min [f;(x)] < C, the next instruction is 
px <q 


\o be chosen as usual, from 6,, 6... Of course displace and accumulate have 
0 meaning for a compare instruction. 

[Illustrations of Coding. As a first illustration of using these special- 
jirpose instructions in writing codes, consider the problem of accumu- 
lating the amounts within each column, from row 7) to row 7p + A, putting 


Tani 4-8. Copzt ro ACCUMULATE THE Rows 7p TO to + h, CotuMN BY CoLumMNt 


. . Dis- & 

tion Opera: t j k P|@ ces Accu c br | 5s Remarks 
tion place | mulate 
address 












































0,0 3 to | to +1 {t+ 110/21 0 0 di 1|0] Add ¢ to i + 1; put 
ini+1 
1,0 7 0 3 Oo /ili tot h—1)/2/Ofisio+nr—1 
> : Stop iteration 
| < : Continue 
8,0 3 4 0 0 1/3 (0) 0 1 0| 0} Update (0,0) 
0 Stop computer 
4,0 0 1 1 1 0/0 0 0 0 0| 0] Constant 











oh Se tl 


{ Heonll that the columns are numbered 0 through 21. 


the results into row 7) + h. Suppose for simplicity that the drum has 


¥! columns (just twice the instruction length) and that the code is 
wrillen in addresses 0, 0; 1, 0; 2,0;ete. The code will make use of oper- 
alion 8, with C = 1, not accumulating the results of summing 7 and 
‘+ 1. (Why?) Then a compare instruction will occur for the tally, 
aud (he iterations will proceed by adding 1 to the generalized addresses 
i), und & of the first instruction until the accumulation is completed. 
Mave the first instruction is updated on each iteration, its 7 address can be 


ted us the tally. The code appears in Table 4-8. 


Aw w second example, consider a linear extrapolation. Suppose that 


fach column represented an aircraft and that rows 21, y1, and ¢; repre- 
wilted the aw and y coordinates of each aircraft at time t:, and rows 22, 


» wad ty represented the « and y coordinates of each aircraft at time fy. 
‘ww problem is to make a linear extrapolation of the coordinates of each 
June at time t. The formulas to be used are 


e=e(t—t) +a, and 


y= (t—t) 2-4 4+ 4, 
ly oe 
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We first form x2 — 21 in row 21, y2 — yi Nn row y, be — bh in row f1, and 
finally t — t2 in row t. Then we form the quotient of rows x; and é; in 
row 2; and of rows yi and t; in row y1. Next we form products of rows t 
and 2, in row 2; and of rows ¢ and y: in row 4. Finally row 2 1s added 
to row 2; and put in row 23; and row ye is added to row yi and put in TOW 
ys, and the extrapolation is completed. The method of coding is straight- 
forward, using C = —1 in operation 3 for subtraction. 

As a third example, consider a numerical integration: according to th 
rectangular approximation. Suppose that the successive values of th 
function at points x are given by fi(x) and that the intervals are unequal 
and Are given inrowi + 1. Then for the rectangular approximation W! 
want to form the product of rows7 and + 1, column by column, and the 
accumulate this product over the columns. Here we make direct use 0 
operation 6 and form 


21 
Y, filwfens(u) 
u=0 


in row i +2. Then the value of the approximation is found in ro 
i + 2, column 21. - 

Description of a Special-purpose Business or Logistics Computer. O 
of the most important features that a special-purpose computer of th 
type should have is rapid, flexible input-output. A general characte 
istic of the problems under consideration in this section is the large m 
of input data to fill the array and the large mass of data computed fro 
the array. This is in fact the main reason for having instructions th 
operate with generalized addresses. The given data may be characte 
ized as items and characteristics associated with the items. Here t 
characteristics are the numbers upon which it is necessary to perfor 
operations, and the items are the specifications of these numbers. Hen 
on the drum the location of a number can represent the item or spect 
cation, while the number itself, i.e., the contents of this location, will 
the characteristics. 

For example, in a naval supply problem an item might consist of t 
set of numbers (o,21,%2,03), where xo = ship number, «1 = nee peri 
(e.g., month and year), %2 = Navy material class (e.g., class 75, di 
engine parts), and x; = source of supply (e.g., the vessel itself , an aflo 
activity, a shore activity). The associated characteristics might cons) 
of the set of numbers (Yo,y1,y2), Where yo = total number of items co 
sumed, yi = total weight in pounds, and y2 = total volume in eu 
feet. The source of this data may have covered 12 ships, during 
time periods, for 50 Navy material classes, and from three sources 
supply, or 129,600 items. For each of these items there appear th 
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xix-digit numbers representing yo, yi, and y2. The data would then con- 
(nin 2,332,800 digits. 

Processing this data might consist in computing and printing tables 
with entries yo, y1, and y2 corresponding to some specified set of the 2’s. 
lor example, the set (%0,a1) would lead to a two-way table of vessel 
umber vs. time period, with entries covering all classes of material from 
ull sources. Or additional functions of the y’s may be desired. Average 
values are about the simplest example: y; = yi/yo = average weight in 
pounds; ys = Y2/yo = average volume in cubic feet; ys = 2,240yo/y1 = 
stowage factor in cubic feet per long ton. 

It often occurs that a characteristic of one problem becomes an item 
specification for another problem, or the reverse. Tor instance, it might 
he desired to compute a table giving the values of yo, yi, and y» for each 
ry, ©, and ys, that is, for each ship, time period, and average weight. 
In such a case the numbers yo, yi, and y2 will have to be relocated so 
(hat xo, a1, and ys will now specify their locations. Hence it becomes 
‘lear that in business and logistics computations searching, sorting, and 
ordering play an important role, in addition to the types of computations 
vonsidered above. 

In the previous section we assumed that the information was recorded 
on au magnetic tape; here, however, we shall consider searching, sorting, 
und ordering of items by characteristics accomplished on an auxiliary 
drum, different from the one with which the computations are performed. 
The input data, which might, for instance, be on punched cards, is read 
onto the searching drum, both the items and the characteristics being 
tecorded. This drum is appropriately searched, and the characteristics 
(lelermined by the search are recorded on the computational-array drum 
ii) positions (i.e., addresses) that specify the associated item. Any 
feurrangement or reordering is accomplished by use of the searching 
drum, 

(ur special-purpose business or logistics computer is thus seen to be 
two interlocking computers, an array-manipulation computer and a search- 
jig or information-retrieval computer (see Vig. 4-12). Each would have 
iin own programming system. The array-manipulation computer would 
lve special-purpose array-manipulation instructions as described above; 
(he information-retrieval computer would have instructions enabling it 
(0 perform the operations described in the previous section. Ideally the 
{wo computers should be able to operate concurrently; appropriate inter- 
lovking conditions would have to be satisfied, so that the two computers 
uid computer programs would be properly synchronized. 

In addition, we could attach a general-purpose computer to this setup 
to have a large-scale system (lig. 4-13), Again, for concurrent opera- 
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Array-manipulation 
computer 


Computational 
array drum 


Information-retrieval 
computer 


One method for solving this problem is to have a special input drum for 
(he computer. Everything that is read into the computer is read directly 
onto this drum just as it appears on the input medium. If the input were 
punched cards with 80 columns and 10 rows, then the data of the cards 
would be read onto the drum just as it appears on the cards, with a 
(nit corresponding to a hole, a zero to no hole. Then a special translating 
computing unit would be coded to interpret this information appropri- 
atcly. The translating unit would interpret the raw information on the 
drum, change it into appropriately coded binary data, and put it into 
(he computer’s main memory. This technique is extremely flexible, since 
wll that is needed is a method for getting the raw data onto the drum, 
in any digital fashion whatsoever. 

‘This naturally brings up a second problem. Since reading in and trans- 
luting the data will be time-consuming, it would be more efficient if the 
womputer could be computing on other data, or even another program, 
soncurrently with the read-in and translation. Such a concurrent-input 
fealure presents no problems other than logical and electronic design, 
provided that certain interlocks are observed. In reading data into a 

















> 






High-speed 
printer output 
Punched-card 

output 







































Information- General- eomputer it is certainly known in which part of the main memory the 
output retrieval hataned information will eventually be stored. A simple type of interlock will 


seinen: halt a concurrent computer program that refers to a main memory address 
(hat has not yet been read into; when the input information is inserted 
jiito this address, the program automatically continues. 

li so far as output from the computer is concerned, the translator acts 
i) reverse, putting information on the special drum in a form which when 
yond out from that drum will correspond to any desired external coding 
scheme. Similarly, with proper interlocks concurrent output can be 
accomplished. 

I'he Large-scale Data Processor. The large-scale data processor as 
described in this paragraph (see Fig. 4-14) is a fictitious computer; its 
(loncription is used here as a vehicle for discussing the problems that arise 
i) multiple-unit system. The purpose of this illustrative computer would 
jw (o perform general data processing. It encompasses each of the three 
foulures described in Sees. 4-3, 4-4, and 4-5, in addition to the translating 
womponent. Hence it is comprised of a general-purpose computing unit, 
4 logical-control computing unit, a searching, sorting, and ordering com- 


Fic. 4-13. Large-scale system consisting of information-retrieval, array-manipulati 
and general-purpose computers. 


tion interlocks must be maintained. More will be said about such 
arrangement in the next section. 


4-6. The Large-scale Data Processor 


Special Input-Output Problems. When data is collected, the reco 
ing scheme used is ordinarily determined from criteria dictated by t 
particular problem, by convenience, etc. Rarely is the best form f 
computer utilization considered as a factor. Also, data produced by 


computer will not in general be directly compatible for input into anot puling unit, an array-manipulating computing unit, and an input-output 
type of computer. These problems become extremely serious in, f (yanslator computing unit. It contains a high-speed memory, a general- 
example, missile-testing programs, aircraft-flight evaluations, ete., wh purpose magnetic drum, a translator input-output drum, a sorting drum, 
exceedingly large masses of data are produced very rapidly. The pro 4) urray-manipulation drum, as well as magnetic-tape units. It has 


sonventional input-output equipment and a signal-sensing input unit for 
foul-time control. 
he main point, however, to be illustrated by this example is that any 


lem of conversion of this data to a form that can be read directly into 
computer for analysis then becomes almost a larger task than colleet 
the data, 





or all of the computing units may be operating (i.e., 
currently, using any or all of the memories or input-output units. 
mechanisms that enable such concurrent operation are found in the in-ou 
selector—concurrent input-output interlock control and in the computing 
units interlock control. 


two possible ways it can operate. 
processor is operating on multiple independent codes simultaneous 
The instructions for each of these codes are found in the memory unt 
At any time, each code is being computed in a different computing u 
Each computing unit has its own instruction register, and its 0 
current-address register that sequences the program according to t 
instruction definitions of the particular unit in which it is loeat 


MACHINE LANGUAGES [CHapP. 


General-purpose 
computing unit 





Searching, sorting, and ordering 
computing unit 
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en eee 
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Logical-control 
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Computing-units interlock control ae | 
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Magnetic-tape memories 














| Typewriter, punched-paper 
nuns | tape, punched cards, 
fo! a a = Ld 
4 Signal sensing input unit 


Fia, 4-14. Large-scale data processor. 







| Typewriter, punched-paper 
tape, punched cards, kl 
magnetic tape, etc. 


ea 
units 







Let us consider the computing-units interlock control first and deseri 
Consider the case where our da 


computing) con 
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‘There are times, however, when one code will want to operate in a differ- 
ent unit, will want to change computing units. These change-unit 
instructions are not handled in the computing units; instead they are 
sensed and sent to the computing-units interlock control. This interlock 
vontrol keeps track of which units are being used. If a code wants to 
vhange to a computing unit that is being used, the code progress is stopped 
until the desired unit is free. A waiting line, or queue, is formed if 
hecessary for each computing unit. The order, or priority, in which 
Waiting codes are selected from the queues can be made as complicated as 
is considered necessary for the purposes of the computer design. 

When the desired computing unit becomes available, the contents of 
(the instruction address register is transmitted to this computing unit 
und the code is continued in its new unit. In other words, a computing 
\init processes the instructions it retrieves from the memory as directed 
hy its current-address register; when changing from one code to another, 
/ computing unit merely needs to have its current-address register 
thanged. The control mechanism for this change is located in the 
#omputing-units interlock control. By this means each of the computing 
\inits can act almost as a separate computer in its own right, except when 
equired to idle by the computing-units interlock control. Even though 
this concept for handling multiple codes at the same time is reasonably 
simple, the detailed techniques in a specific case may present many 
special considerations and exceptions. 

The second way in which a computing-unit interlock control might 
Hperate is to have only a single code being computed. However, it often 
lwppens that different parts of this code can be computed at the same 
lime. That is, the logic or flow diagram of the code may not necessitate 
4 completely serial sequencing of operations, but several parts of the code 
Wight be processable in parallel at the same time. In this mode the 
wmputing-units interlock control scans the entire code, recognizing 
jurts of the code that can be executed independently. It does this by 
ubserving memory reference locations. If one part of a code never refers 
{0 & memory location used by another part of the code, then it can be 
complished at the same time as that other part of the code. The rules 
for observing this can become rather complicated. The computing-units 
\iterlock control must accomplish both functions; how thoroughly it 
ivcomplishes either function depends on the systems design and purpose 
of (he computer. 

linally we consider the in-out selector-concurrent input-output inter- 
loek control. Besides in-out selections, this control unit must accomplish 
Wiput-output interlock control similar to that described above. All 


iput-output instructions are automatically sensed and sent to this inter- 
lock control. If free, the appropriate unit is selected; if not, the code 
Will wait, as was the case in the computing-units interlock control. 
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This interlock control must communicate with the computing-unit 
control, so that a code may be stopped when it wants to use a memor 
location not yet loaded. Again the details can become exceeding] 


complicated. 


4-7. Exercises 


For the digital differential analyzer: 

4-1. Compute sin x and cos x, and graph the results as they would be comput 
(see, for example, Fig. 4-1). If the word length were 4 bits, how would the probl 
be scaled? ; 

4-2. Draw the arrangement of units to compute y = 1/z. 

4-3. Draw the arrangement of units to compute y = In x. [Hinr: dy = (1/2) 
Let w = 1/x; then dy = w dz, and dw = (—1/2?) dx = —w dy.] 7 

4-4, Draw the arrangement of units to compute y = 2”. {Hinr: dy = na" dz 
a{(n/x) dx] = 2*d(nIn x) = yd(n in z).} ; ; ; 

4-5. Draw the arrangement of units to solve the following simultaneous different 
equations: 


dy dys _ 
ou = Aye + ArsYs a = Raiyi + Arsys te Asiy1 + AseY2 

4-6. How can a digital differential analyzer be used in conjunction with machi 
tool control (see Chap. 1)? 


For the real-time control computer: ; 
4-7. How would the external-control instruction break the loop when no compari 


is successful? (Hint: A dummy comparison would automatically compare favorab) 
How would this dummy comparison be arranged, i.e., what would be the first 
second words of the dummy condition?) he 

4-8. Suppose that the input word has the following inputs (see Fig. 4-15): tai 
full, L(1); tank 2 full, L(2); valve 1 open, V(1); valve 2 open, V(2); valve 3 0 
V(8). For the output or control-signal word let O(1) mean open valve 1; O 
open valve 2; O(3), open valve 3; C(1), close valve 1; C(2), close valve 2; and C( 
close valve 3. Define an appropriate instruction, including an external-con 





Valve 2 


Fra, 4-15. Simple system controlled by real-time logienl-control computer, 
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instruction, for the logical-control computer as discussed above, and code the process 
of filling the two tanks, one at a time. 

4-9. Code the problem used in Exercise 4-8 without the use of an external-control 
instruction. 


For the information-retrieval computer: 
4-10. Suppose that a factory item is composed of the following word: 


Number of 
Name of hours Type number of 
worker Date on job completed object 


ee eee eee eee 


Ee ae ae a Ps ae ee | 


where each box can be filled with a digit or an alphabetic character. Define an 
instruction system for a special-purpose retrieval computer as shown in Fig. 4-9, 
und write a code that will determine how many objects of type 54321 were completed 
between Mar. 3, 1959, and Mar. 15, 1959. 








lor the business and logistics computer: 

4-11. Consider the example of computing building-material purchase schedules and 
requirements. Suppose that the requirements at the site for each month during the 
construction are as follows: 












































Type of material 
Month 

0 1 2>)) 33 4 5 6 vA 8 9 10 
March............ 100 75 | 60 | 54 0 0; 20} 60; 10] 20 | 30 
Aprils: cas saieen cs 50 0 | 40 | 60 0 0; 20; 90; 10} 20] O 
May. ....cecseeeee) 25 0 | 30 | 50 0 0} 30] 150 10 | 30} O 
JUNGs ceca ica es 30 | 115 | 20 | 70 0 0} 50; 50); 50 0 | 20 
VU soci cs cent 0 0} 10 | 30 10} 20-| 90}; 20] 70 0] 10 
August............ 0 | 100 5 | 50} 50! 301 100 10; 90] 50] 5 
September. ........ 0 0; 0} 90 | 100 | 100 |} 150 0! 100} 90} O 
October........... 0 0} 0} 20 0 | 150 | 200 0 | 150} 100 | O 





Nuppose that the delay time and the cost per unit are as follows (where the cost per 
nit of the material varies with the number of units ordered as shown): 

































Type of material 
0 1 2 3 4 10 
. AE } oes 
Delay time, 
months...... 3 5 6 2 4 7 
Units ordered: 
0-0 150 
10-19 120 
20-49 110 
50-00 100 
100 or over 
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Determine the schedule for the ordering, and determine how much money will be 
spent each month, assuming that the orders are paid for at the time they are received. 

4-12. Write a code that will compute Exercise 4-11. 

4-13. Write a code for numerical integration by the trapezoidal rule. 

4-14. Write a code for linear extrapolation. 

4-15. What situations might require interlocks between the information-retrieval 
computer and the array-manipulation computer? 


For the large-scale data processor: 

4-16. Draw a flow chart of a code that would interpret raw data from cards. Sup- 
pose that a computer word contains 15 bits, with the decimal point to the right. 
Assume that the raw data appear on cards with 80 columns and 10 rows, where each 
card is interpreted as containing 20 four-digit numbers and the digit associated with 
each column is the row location of a single punch in that column. For example, the 
first two numbers on the card illustrated in Fig. 4-16 are 1,340 and 1,235. Assume 




































































Fia. 4-16 


that the translator drum records the rows of each card in sequence, from 0 to 9. The 


code is to compile the successive numbers (i.e., successive groups of four columns) 
in binary in the main memory of the computer. 
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PART 2 


AUTOMATIC-PROGRAMMING LANGUAGES 


CHAPTER 5 


DEVELOPMENT OF AUTOMATIC PROGRAMMING 


6-1. Introduction 


Part 1 of the book was concerned with the details of programming and 
coding at the instruction, or machine-language, level. The four-, three-, 
\wo-, and one-address instruction formats were described and compared. 
The octal-coded binary and decimal-coded binary systems were com- 
pared. The steps in preparing a flow chart, symbolic code, and finally 
the code itself written in the actual machine language were described. 
Decision branches, instruction modification, and recursion codes were 
discussed and examples given. Coding was found to be detailed and 
luborious, and the one-address system particularly aggravated these 
(qualities. 

Advanced programming techniques attempt to lighten the load of 
(he programmer and coder. Their purpose is to try to make the com- 
puter itself help prepare the program or code, minimizing the amount of 
writing a programmer need do. The means used to accomplish this 
in fo prepare or precode the computer with an intermediate, or automaizc, 
program which is always available in the computer memory. This inter- 
mediate or automatic code encompasses certain aspects of coding that 
are common to many programs and of general applicability. Then, 
when the programmer is coding any specific problem, his work can be 
shortened by taking advantage of already precoded items of the automatic 
program that are already inthe computer. Such automatic programming 
vin greatly simplify the coding of computers otherwise difficult to code. 
Unfortunately, however, the coder must then learn how to use not only 
(he computer but also the automatic-programming languages. 

Rarely is a program written that is found to be absolutely correct 
(he first time it is run on the computer. Hence in this chapter we first 
consider automatic aids for analyzing an already coded program for 
errors, Next automatic programs designed to lighten the load of the 
programmer are considered, These can be roughly classed in two cate- 

oy 
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gories—the interpretive routines and the compiling routines. Various 
examples are discussed as illustrative of the methods and techniques of 
formulating and using such automatic programs. The specific examples 
given are intended to illustrate the principles involved, and their particu- 
lar form as chosen for this chapter was dictated by pedagogical consider- 
ations only. As will become clear during the course of the chapter, 
there are many other ways to solve the problems posed, and in actual 
practice the final choice depends on the primary purpose of the code and 
the detailed characteristics of the computer to be used. 


5-2. Program Debugging Methods and Routines 


The Problem of Program Debugging. The first time a program is 
tried on a computer, it usually does not run correctly. If the start 
button is pushed and the program allowed to run, one of four results 
can oceur: First, the computer might run up to a point and then just 
stop, or “hang up.” Second, the computer could continue to run for an 
excessively long time, neither stopping nor producing any results. Third, 
the computer could run and, as expected, print out results, but wrong 
results. And finally the program could run correctly. In the first three 
cases an error is indicated in the program, and the problem is to detect 
where—at what instruction or instructions—the error occurs and then 
to correct it. This procedure is often referred to as “debugging” a 
program. 

It is hard to describe in general how to debug a particular program; 
ingenuity on the part of the programmer, systematic and neat flow 
charts, coding sheets with detailed remarks, and a thorough knowledge 
of the code are, of course, all essential. When a program just stops, this 
usually indicates that the computer has been asked to do an illegal 
instruction; for instance, the instruction might have an operation code 
not corresponding to any operation or might contain an address that the 
computer does not have, etc. Illegal instructions are often the result 
of an error in instruction modification. When a program never stops, the 
program usually has a loop with a faulty exit. The exit may not have 
been properly set up, or else a nonconverging iteration may be in process, 
ete. The printing out of wrong results requires examination of these 
results in the light of the problem: Is the scaling wrong? Are the 
constants correct? Has any phase of the calculation been skipped} 
etc.? 

However, certain techniques can be discussed that are often used to 
aid the debugging process. These can be generally classified into dynamieé 
debugging and post-mortem debugging. Dynamic techniques try to 
check the program while it is actually running on the computer; post. 
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mortem techniques examine the computer memory after the program has 
stopped, or “‘hung up.” 

Dynamic Program Debugging. Let us first consider dynamic program 
debugging. There are two generally used methods: the first is called the 
break-point (or ckeck-point) method; the second is called the automonitor 
method. The break-point method consists in writing into the program, 
at critical intermediate points, instructions that either will stop the 
computer so that important intermediate results can be observed or will 
make the computer print out such intermediate results automatically. 
The sign bit on an instruction word is often used to facilitate such a 
process. For example, some computers are designed so that after 
executing an instruction with a negative sign the computer will stop. 
Usually such computers also have the following additional feature: A 
manual switch on the control panel can be thrown so that in one position 
the computer will not stop after a negative instruction—this being called 
the normal switch position—and so that in the other position the com- 
puter will stop—this being called the break-point switch position. The 
advantage of this control-panel switch is clear: in debugging a program, 
the negative instruction stops are used, and the switch is set accordingly; 
when the program has been corrected, these negative stops are no longer 
desired and hence the switch is thrown to normal. 

Another break-point procedure depends on the programmer’s inserting 
into the program at intermediate critical points manual-switch jump 
instructions. In debugging the program, the switches are positioned 
xo that the computer will jump when it encounters such an instruction; 
usually special print-out routines are included in the program, and the 
jump instructions transfer the computer to these routines. When the 
program has been corrected, the switches are turned off and the com- 
puter simply ignores the jump instructions. 

Use of the Automonitor Program. The automonitor method for 
dynamic debugging does not require any additions or special considera- 
(ions in the program itself, as does the check-point method. | Instead the 
wutomonitor method requires the use of a special automonitor pro- 
gram which is read into the computer in addition to the program to be 
debugged. The automonitor program is general and once written can be 
ned for every program to be debugged. Occasionally the computer 
lus the properties of the automatic automonitor program already wired 
in, so that the automonitor process can be carried out simply by throw- 
ing a switch on the computer console. An automonitoring program 
inakes the computer print oul—while each instruction of the program 
being debugged is being executed—the instruction itself, the contents of each 
of the operand addresses, and the computed result. For example, suppose 
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the program to be debugged is as follows: 
oo eS a ee 





Address Instruction 
5o Operation | ao Bo Yo 61 
1 Operation | a1 Bi oat 52 
55 Operation | ae Bo 72 53 


ee ee Nee he ee ed) 
Then the automonitor program would have the computer print out the 


following: 
oo 
Address | Instruction | 1st operand | 2d operand | Result 





50 (60) (ao) (Bo) (yo) 
5 (61) (a1) (Bi) (vw) 
62 (62) (a2) (B2) (y2) 


as ee ee ee 

In other words, the automonitor program gives a complete record for the 
program being debugged of just which instructions are being done by the 
computer and what the results were. With this information it becomes 
easy to spot any error in the program being debugged. Note that the 
automonitor program is a program that operates on another program— 
more about this point will be said in the succeeding sections. 

Of course certain instructions of the program being debugged are 
handled somewhat differently. For example, in a compare instruction, 
(y) need not be printed, but just 6, (5), (a), (8), for the address of the 
next instruction tells which way the comparison went. Common sense 
clearly dictates what an automonitor program should print out. 

An automonitor program for a three-address computer would be 
similar to that just described for a four-address system. For a two- 
or one-address system the contents of the accumulator or other registers 
become of importance, and hence the contents of the accumulator is 
printed out. For instance, in a one-address system, the print-out would 


look like this: 


59 (50) (ao) (ace) 
5: (51) (a1) (ace) 
5s (52) (a2) (ace) 


The Automonitor Program Itself. In order to grasp more fully the 
general concept behind an automonitor program, let us examine in & 
cursory way how such an automonitor program itself might be writte 
for a one-address system. The flow chart is given in Fig. 5-1. Th 
automonitor program is in one part of the memory, and the program to 
debugged is in another part of the memory, The automonitor progra 
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must be given the address of the first instruction of the program to be 
debugged. Then the automonitor program prints out the desired record 
and also executes the instructions of the program to be debugged. First 
the automonitor program prints the 
address of this first instruction, 69; 
then it copies (69) into a special tem- 
porary located in the automonitor 
program’s partofthe memory. This 
instruction is printed, and (ag) is 
printed. Next the automonitor pro- 
gram directs the computer to execute 
the first instruction in its location 
within the .automonitor program. 
Then (acc) is printed, and the 
wutomonitor program goes on to 
the next instruction, etc. There 
are a few important details that 
must be attended to. When the 
program to be debugged is being 
executed, the contents of the accu- 
mulator from instruction to instruc- 
(ion is, of course, important. But 
when the automonitor program exe- 
Gules one instruction at a time and 
inserts print routines, etc., between 
(hese instructions, the contents of 
the accumulator becomes lost. In 
order to correct this situation, the 
wontents of the accumulator (and 
other registers) must be recorded 
alter the execution of each instruc- 
lion of the program being debugged. 
Then, before the next such instruc- next instruction 
Lion isto be executed, the registers are : 
properly preset.t In this way the two eee a ae as 
programs are properly interleaved. 

l’ost-mortem Program Debugging. Consider next post-mortem program 
(debugging. The first method is to run the program through and observe 
(he results. From a knowledge of the program and the erroneous results 
i error can often be located. For example, if the magnitude of the 





Copy (6) into temporary 


Print (5), i.e., as it appears in 
temporary 


Print (a) 
Reset accumulator and registers, and 
execute the instruction in the temporary 
Record (acc) and (registers) 














{| Also, provision is made in the automonitor code so that jump instructions are not 
wrformed directly but rather operate on the automonitor 64-address reading 
fetruction, 
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results is off, the scaling is checked for error, etc. The second method is 
to print out the entire memory of the computer after the program has 
been run. If the program got into a loop with no exit, then the tallies 
can be checked on this memory print-out. All instructions that were to 
be modified by the program can be checked to see whether or not they 
were correctly modified, ete. A third method is to have another program 
that examines the program to be debugged after it has run. This pro- 
gram compares the program to be debugged with itself before and after 
it has been run and tells the programmer where they differ. In this way 
the programmer can determine if any instructions were inadvertently 
destroyed, if all differences that should appear actually do appear, etc. 

Of course the programmer uses combinations of the techniques that are 
at his disposal. What we have briefly outlined here is in no way come= 
plete but rather indicates some of the directions from which aids to 
programming may arise. 


5-3. Mathematical Interpretive Routines 


Automatic Programming. There are two general kinds of automatic 
programming, called interpretive routines and compiling routines. The 
purpose of automatic-programming techniques is to reduce the number 
of instructions actually written by the coder. The method for accom- 
plishing this is previously to place in the computer a library of subroutines 
that can be used by every coder, so that these routines will not have to be 
rewritten each time they are needed. A set, or library, of particul 
subroutines, together with a method for using them, is called an auto 
matic program. The difference in the two kinds of automatic progra 
lies in their use: in the interpretive program the subroutines are to 
used by the coder’s program while it is running on the computer, where 
in the compiling program the subroutines are used by the coder’s pr 
gram only at the time the coder’s program is read into the computer. 

We have arbitrarily classified interpretive routines into three types 
(1) mathematical, (2) simulational, and (3) data handling. The su 
routines of the mathematical type of interpretive routines compu 
values of functions, those of the simulational type are usually designe 
to make one computer look like some other kind of computer, and tho 
of the data-handling type assist in organizing large amounts of data an 
in data searching. 

Use of Mathematical Interpretive Routines. Let us first consider 
example of a mathematical interpretive routine. The purpose of su 
a routine is to enable a coder to form such functions as sin w, COs x, e”, # 
x/z, etc., merely by writing a couple of instructions. Of course the 
functions would appear as subroutines already in the computer, as p 
of the interpretive routine, However, the interpretive routine must al 
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contain a section that can interpret the instructions that the programmer 
writes. When a function subroutine is to be used, the following infor- 
mation must be recorded: (1) the address that contains the value of the 
independent variable 2; (2) the address into which the result f(x) is to 
he put; (3) the address in the program from which the jump to the sub- 
routine was made, so that after the subroutine has been campleted the 
computer will know where to jump back to the original program; and 
(4) the address to which to jump in order to go to the subroutine itaelf 
( onsider a two-address system: two words are needed to go to a sibs 
routine (since four addresses are required). We could arrange the words 
us in Fig. 5-2. The first word is a return jump instruction; the second 


Operation a B 






Return jump /Standard address| Address of Ist instruction of fune- 
tion subroutine 





Address of x Address of f(x) 


Fig. 5-2. Instruction and address-data word for interpretive routine. 


word is not an instruction but is just to tell the subroutine where z is and 
where f(x) is to be put. Recall that the return jump instruction jumps 
{o 8, and in addition the instruction memorizes as the contents of « i 
own address. Hence the return jump takes care of items 3 and 4 above 
wind the second word of items 1 and 2. Thus the programmer, each time 
he wants to use a subroutine listed in the interpreter library koe eamnte a 
finet ion, need write only the following “instructions’’: (a) a return jump 
innt ruction, where a@ is a standard address (standard, that is, for this par- 
Heular interpreter) and where @ is looked up in a book of faehnictions for 
the interpreter which lists the proper 6 for each function; (b) the word 
following the return jump, as described above. This has illustrated the 
procedure for using a mathematical interpreter. 

I'he I nlerpretive Routine Itself. Now let us consider the interpreter 
program itself. Besides consisting of the function subroutines, as we 
ubserved above, it must also consist of a part that interprets the two 
words that the programmer writes. This part looks at the standard 
uddress to find out where the address of the return jump is. Knowing 
(hut the next word gives the locations of x and where f(x) should be put 
{his part of the code then relates that information to the mapeaprinte 
mibroutine and transfers control to this subroutine. This part can be 
thought of as a subroutine also; we shall call it subroutine Q. The flow 
(ingram might appear as in Fig. 5-3, where, to be specific, suppose that 
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Mathematical interpretive routine 





In subroutine Q set R=Ry, f(x) = sinx 






Part of program ——s 
that will make use 
of interpretive 
routine 


Return jump 
(to A) 








Subroutine Q 













Appropriately 
modify subroutine 

















3 for f (x) according 

5 Iz+1| to data word; 

3 Ao set exit B of 

: (B:) subroutine f(x) to 
jump to Z+ 2 





SSS Sees 
SS a 





Main part of subroutine f(x) = x! 


In subroutine Q set R= Rs, f(x) = Vr 
Main part of subroutine f(x) = “Vx 


Fig. 5-3. Flow chart of interpretive routine. 





it is desired to use a cos x subroutine. The long dashes indicate t 
path of the computer through the program and subroutines ; the sho 
dashes, the operating of one subroutine on another subroutine. We | 
7 be the address of the return jump instruction; then Z + 1 is the add 
of the data word, and the program should resume at address Z + 2. 
As an example of the type of things that occur in the subroutine 
consider how it inserts the value of a into the subroutine for f(a). Su 
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routine f(x) has a certain cell reserved for the value of 2; call this cell 
1’, for “temporary for x’; within subroutine Q is an instruction that 
(transfers the value of x to T, from the address given by the data word 
of the program. Suppose that the instruction were ‘Transfer (a) into 
6.” Then Q must have a previous instruction that puts into the a address 
of the transfer instruction the address of 2 as given by the data word, 
i.c., an extract instruction that takes the a address of the contents of 
Z + 1 and puts it into the @ position of the transfer instruction. But 
(his extract instruction must involve the address Z + 1, which is obtained 
from the standard address associated with the return jump. Hence 
(here must be previous instructions that take Z as given in the standard 
uddress, add 1, and insert this into the a part of the extract instruction. 
Next consider the 8 part of the z-to-T, transfer instruction: this must be 
some address specifically reserved in the subroutine of f(x) for the value 
of w. Since subroutine Q is a general subroutine, it must be told this 
uddress each time a subroutine for f(z) is to be used. This is done in the 
part of the subroutine for f(x) denoted in Fig. 5-3 by “In subroutine Q 
wel R = Ro, f(x) = cos x.” In this part there is an instruction which 
extracts the address reserved for x into the transfer instruction of sub- 
routine Q. This process is valid since the computer will pass through 
(his instruction before getting subroutine Q. Summarizing, there is 
(i instruction in the subroutine for f(x) that sets 6 of the transfer instruc- 
lion; and there is a series of instructions preceding the transfer instruc- 
lion in subroutine Q that leads to the setting up of the a address of 
the transfer instruction. 

Our example of a mathematical interpretive routine has several inter- 
oling features: First, one subroutine uses another subroutine—i.e., each 
subroutine for f(x) uses the subroutine Q. Such a feature may occur 
ii other ways also. For example, in evaluating cos x, suppose that the 
power series were used: 


a an 
cos « = > (—)" Goi 
n=0 


Then the subroutine for cos x will use the subroutine for a*/x!, where 
' © 2n in this case. In fact, to use the subroutine for a?/z!, the same 
lochnique of the return jump and data word is used (why?). Of course 
the use of subroutines by subroutines can be carried deeper than two or 
three, Second, note how many instructions are actually needed to 


perform the simple transfer described above. The moral of this is that 
poding a simple job can be complicated on a computer. This, indeed, is 
{he reason that it pays to have a separate subroutine Q rather than to 


fepeat the instructions of this subroutine for each f(x) routine. Finally, 
the main purpose of this mathematical interpretive routine is to make 
the coding problem easier by allowing only two words to be written when 









































168 AUTOMATIC-PROGRAMMING LANGUAGES [Cuap. 5 


a function is to be evaluated. Naturally the process described here was 
greatly simplified, since many other complications may arise—telling the 
f(x) subroutine to what accuracy the function is to be calculated, taking 
care of the proper scaling, etc. 


5-4. Simulational Interpretive Routines 


Use of Simulational Interpretive Routines. Next let us consider simu- 
lational interpretive routines. The purpose of these routines is to make 
one computer appear as an entirely different computer. For example, 
suppose that we had a computer that did not have floating operations. 
By means of the proper routine we can make this computer appear to be a 
computer that has built-in floating-point operations. Or we can make a 
one-address computer appear to be a two- or three-address computer. 
Another example might be the making of an ordinary computer into @ 
special-purpose war-gaming computer. Of course the coding for the 
simulated computer will be quite different from the coding of the actual 
computer. However, the purpose of the simulational interpretive rous 
tine is to enable the programmer to write his code in the language of t 
simulated computer; he need never know that this is not the actual 
method of coding for that computer. Obviously, then, the interpretive 
routine must interpret every instruction of the programmer’s simulation= 
language code. Thus the interpretive routine looks at the first instrucs 
tion of a simulation-language code, interprets it, then goes to the ne 
instruction of the simulation-language code, interprets it, etc., for eae 
instruction passed through in the simulation-language code. Immedi 
ately we see that the interpretive routine must also keep track of whay 
instruction of the simulation-language code to take next; i.e., the inte 
pretive routine must contain a simulated-instruction address counter, 
simulated “simulation-language” instruction register, simulated decoder 
simulated accumulator, etc. In other words, the simulation routi 
must perform all the functions that the electronic circuitry would per 
form if the simulated computer had been real. 

The Interpretive Routine Itself. For the sake of illustration let w 
discuss a simulation interpretive routine that makes a two-address no 
floating operation computer appear like a one-address floating operatio 
computer. To be specific, suppose that there are 6 bits in the operati@ 
and 9 bits in each of the two addresses of the real computer. Suppo 
also that in the simulated computer there are to be three octal symbo 
for the operation and five octal symbols for the address. Of cour 
since the real machine can address only 2° memory cells, even thou 
theoretically 2! memory cells can be addressed by the simulated 0 
address instruction, only 2° memory cells are available, Hence th 
octal positions in the simulated-instruction format are meaningle 
However, one could interpret these as addressing cells on magnetic ta 
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In so far as the simulated numbers are concerned, the rightmost 
two octal positions of a word will represent the exponent, the rest of the 
positions the value of the number. These formats are then as follows 
where each wide box represents an octal digit (i.e., three binary bits): 


ete. 


















































~_ 
Opera- 6 
tion a B & 
—— ——_————. a 
Instruction | | 
Real computer Number 
a ————— eT 
Number | if 
Opera- 
tion a 
Instruction | | 
Simulated computer Number aay 
e—e—_—— 
Number | 





Let us suppose that the real computer has the two-address operations 
linted in See. 3-1, as well as the following additional instructions: 


Code Description of operation 


0 Shitt (acc) left @ positions; put result into 8 
it Shift (acc) right @ positions; put result into B 
v0 Extract (a) into 8, using (acc) as the mask; ie., replace those bits of (8) with 


corresponding bits of (a) in those positions where (acc) has units, and put 
result into B 


Suppose that the one-address instructions to be simulated are these: 
Code 


OO1 
002 
0038 
004 
005 
006 
007 


Description of operation 


Add (a) to (ace), put result in acc, floating unnormalized 
Multiply (a) by (acc), put result in acc, floating unnormalized 
Subtract (a) from (ace), put result in acc, floating unnormalized 
Divide (acc) by («), put result in ace, floating unnormalized 
Conditional jump: if (acc) is negative, take (a) as next instruction 
Transfer (acc) into @ 


Jump: take (a) as next instruction 


Of course the accumulator referred to in these instructions is the simu- 
juled accumulator. To be more specific, let address 776 be the simu- 
juted-instruction address counter, 775 the simulated instruction register 
774 the simulated accumulator, 

‘The programmer will now code in terms of these simulated instructions 
only (never writing down any of the real instructions). It is up to the 
interpretive routine to sequence through these simulation-language 
jnetructions, interpreting them as it goes along, Obviously the inter- 
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pretive routine is written in the real machine language. Figure 5-4 
is a general flow diagram showing how the interpretive routine sequences 
the computer through the simulation-language instruction code. The 
student should study it carefully. We shall first discuss some of the 
details of box 1 of this flow diagram. 
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Transfer the next simulation 
instruction into simulated 
instruction register 


Decode the simulated language 
instruction, i.e., execute the 
simulation instruction 


Is the simulation instruction 
a comparison or a jump? 


Advance the simulated 
Instruction counter by 1 


H 


Change the simulated ~ 
counter to proper 
simulated address 


¥yq. 5-4. General flow diagram indicating how interpretive routine sequences the com= 
puter through the simulation-language code. 


The heart of box 1 is a pair of instructions that transfer the prope 
simulation-language instruction from its location in the code to the 
simulated-instruction register. Preceding this pair of instructions are 
instructions that set up this pair to operate on the proper addresses. In 
symbolic code, the pair might look like this: 






Operation a B 


. | 
Address of a constant of all Address of proper simula- | 
zeros, 1.e., tion-language instruction 







Address of first instruction 
of box 2 



















Simulated-instruction regis- 
ter, ie., 775 


Transfer, 
ie., 52 
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lhe address of the proper simulation-language instruction is found in 
the simulated-instruction address counter. Hence the contents of the 8 
part of this word (i.e., the contents of 776) must first be extracted into 
the 8 part of our add instructions: 








Operation a B Remarks 
se 1.e., ¢ Address of a constant that has all | Set up mask 
53 units in 8 position and zeros else- 
where 
i 776 | Address of above add instruction | Do extraction 
i.€., 











Suppose that we put the first instruction i 
of box 1 into ad ; 
altogether for box 1 we would have pepe 








Opera- 
Address on a B Remarks 
600 53 604 605 | Set up mask 
601 20 776 602 | Do extraction 
602 53 604 bite 
603 52 775 | 606 } sioner 
604 00 000 000 | Zero constant, i.e., 
605 00 000 777 ‘|| Mask 


























, ear of the simulation-language operations is performed, of course, 
ry a su routine contained as part of the interpreter routine. Hence box 2 
consists of these subroutines, in addition to a jump table that looks at 
the simulation-language operation and jumps to the proper aibnedene. 
Che shift instruction would be used in this procedure. The simulations 
lunguage instruction is shifted to the right 17 (octal) positions [15 (deci- 
mal), putting the operation code in the 6 position, leaving zeros else- 
where. Then this shifted word is added to the draiafet instruction 
(hat will jump to the proper jump instruction of the jump table. In thi 
way the simulated operation code is used to enter the proper Shek ine 
from the jump table. Note that first the transfer instruction that jum ‘ 
\o the table must be reset, for it becomes modified each time it iG “ 
Hence, in the first part of box 2, we have the code of Table 5-1 Wh 
i type 52, or transfer, instruction is used as a jump, the a aildréss jac 
ot have much meaning. Hence we always put 777, the address of fhe 


real accumulator for 4 

/ , a, to ensure no change due to thi pa p 
‘ / 8 I if of t he Lm 
jniwtr uction, ai 




















ee 
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Let us examine briefly the subroutine for the ee 
multiplication, floating unnormalized, as an example of our ear 
subroutines. The purpose of this subroutine is to multiply («) by : ; 
where a is the address found in the contents of 775, the simu a 
instruction register (see the instruction and number formats sd id 
simulated computer above). The result is to be put into 774, the 


TapLe 5-1. Cope ror Box 2 or Fra. 5-4 


fa ee 














OpStA- Remarks 
Address aan a B 
606 53 604 610 Put jump constant into Reset jump 
accumulator instruction 
607 52 614 611 Put constant into 614 ; ; 
610 52 777 614 Constant used to reset jump instruction ° : 
611 53 604 775 Shift simulated-language instruction to 
612 31 017 777 } right 17 (octal) positions in accumulator 
613 51 614 614 Add shifted instruction to jump instruction 
614 52 777 614 Jump instruction 
615 52 777_~«| Add 
subroutine 
616 52 777 | Multiply 
subroutine 
617 52 777~=~*| Subtract 
subroutine 
on we air. | Dede : Jump table that sends control to proper 
subroutine simulated operation subroutine 
621 52 777 ~+| Conditional 
jump 
subroutine 
622 52 777 ~+| Transfer 
subroutine 
623 52 777 +‘| Jump 















































subroutine 


simulated accumulator. In order to perform this miiltip lostioaaay 
through P24+ of each word are to be multiplied, while P1 throug ol 
each word are extracted into initially empty temporaries, er ae 
are added, and the result extracted into P1 through P6 of 77 4 
performing the addition, P7 of the sum must be checked for ova 
the exponent of the result cannot be larger than six poe a 
performing such a subroutine the computer would go to box 3 0 


general flow chart. 


5-5. Data-handling Interpretive Routines 


Use of Data-handling Routines. The manipulation of large masses 
data can become a serious problem to the programmer. Such data mu 


t See Sec, 1-8. 


Alion, or loop. 


each characteristic is followed by a list of its associated items. 
the first, or label, word of each item list will denote the characteristic 
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frequently be interrogated or searched, to find or insert specified items. 
Also, reasonably efficient organization of the data must be maintained in 
order that memory space not be unduly wasted. Hence data-handling 
routines which are designed to assist in such processes must be able to 
(1) search data, (2) relocate data, (3) insert additional or new data, and 
(4) omit obsolete or old data. In this section we shall consider two basic 
organizational concepts for formulating interpretive routines to assist in 
the programming of such tasks: the first can be visualized as a bookshelf 
organization, where the data is searched and moved around in a manner 
akin to the handling of books; the second involves the use of ‘‘threaded”’ 
data components, or lists where conceptually contiguous data components 
may physically be spread diffusely throughout the computer’s memory, 
but where each data component gives the address of the next successive 
component. (We might suggest at this point that, for a fuller under- 
standing and appreciation of the utilization of data-handling techniques, 
the student refer to the discussions on searching, sorting, and ordering of 
Chap. 11.) 

The Conventional Search. Data is basically organized in terms of lists, 
such as inventory lists, billing lists, bibliographical lists, subject word 
lists, and so forth. Suppose first that our collections of lists are recorded 
on magnetic tapes, and recall from Chap. 3 that magnetic-tape read-in, 
read-out, and tape-moving instructions usually specify the number of 
hlocks to be read in, read out, or moved. Hence only multiples of blocks 
vin be handled. Therefore, in recording a list on a section of magnetic 
(upe the list is usually padded at the end with words of all zeros so that 
(he entire list will make up an integral number of blocks. The first word 
of the first block of the list is often used as a labeling word: it contains 
information about the contents of the list, and, most important for our 
discussion, it tells how many blocks long the list is. The magnetic tape 
will then consist of a sequence of such lists, each labeled with the number 
of blocks in its length as the first word of the list. 

There are two advantages to labeling the lists: (1) The computer will 
ul first observe the label word of the first list. If this list is not the one 
(losired, the computer can then move the magnetic tape by the number 
0! blocks indicated in the label, past this list to the next list; the label of 
(he next list will be observed, and so forth. In this way the computer 
iiuy jump over parts of the magnetic tape not of interest during the 
particular search. (2) When a list is of interest, then each word of the 
lint must be examined; this is usually accomplished by some kind of iter- 
The tally of this loop can be set according to the length 


of the list to be examined, as given in the label word. 


lor example, in the second method for searching given in Chap. 11, 
Hence 





ee 
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and the length of the list, say in the a and @ positions for a two-address 
system. Then the computer will look for the first of the given (ordered) 
characteristics by jumping over lists until the appropriate list label is 
obtained. 

A similar process can assist the searching of lists recorded in the high- 
speed memory of a computer. Again the lists are labeled, and again the 
label contains information about both the contents of the list and how 









Start with 
first list 








Compare "types" 


Main 
program 


L———----~-] 


Unsuccessful 
comparison 


Successful 

f Fi comparison 

Pseudoinstruction 
"search" 









Add length of 
current list to 
index register 


Rest of 


main program 





Jump to front 
of next list 


Fic. 5-5. Illustration of use of interpreter routine for the pseudoinstruction “Search.” 


many words there are in the list. The computer can then observe the 
label of a list, and if it is not the desired list, the address of the label 
word of the next successive list can be obtained by, say, adding the length 
of the current list to an index register. The advantages of the label are 
the same as discussed for the magnetic-tape case. 

If a convention is fixed for the format of the label word of a data list, 
then an automatic interpreter routine can be written that will be able to. 
interpret a pseudoinstruction search. For example, suppose the “label” 
is to consist of two words at the front of the list, the first giving the ty 
of items in the list and the second giving the length of the list. A sear 
instruction would contain the type of the list being searched for: t 
interpretive routine would compare the “type” specified in the instru 
tion with the “type” of a list; if the “type” did not mateh successfully 
then the computer would add the length of the present list to an inde 
register and jump to the next list; if the “type” of the list were t 
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ease ee then the interpretive loop would be exited and the com- 

puter would proceed with the next instruction in th i 

ates in the main program (see 
Of course the searching process might be more complicated. Two such 

additional complications will now be discussed. First, after a desired list 

is found, it may next be desired to find a certain sublist of this list. 

Another search instruction would then be in order, but care must be 


Set tallies K and J 
Set j=1 
k=l 






Fia. 5-6. Flow chart to record common numbers on two ordered lists 


taken in the definition and interpretation of such a sublist search pseudo- 
in struction that the sublists of the list are searched. Similarly the search 
of the original list should skip consideration of sublists. A simple method 
for avoiding confusion between list and sublist searching is to include in 
(he label of a list or sublist an identifying character that can be sensed 
by the pseudoinstruction interpreter routine. 

I he second complication to be considered here is that finding the 
desired list usually does not end the search. Frequently a next ste 
(nee Chap. 11) involves the basic operation of the comparison of re 
ordered lists for common numbers. Hence we can have another pseudo- 
instruction for performing this task. Suppose that we have two ordered 
lists, with AK numbers in the first list, indexed by k (k = 1, 2 K) 
and J numbers in the second list, indexed by 7 (j = 1 D ae J). 
A detailed flow diagram of a code to record every seeialidesbbok: is 
common to both lists is shown in Fig. 5-6. Uf the number in the 4 list is 
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greater than the number being compared with it in the k list, then the 
code moves along the k list until a common number is obtained, or until 
a number in the k list is reached that is greater than the one in the j list. 
Then the code moves along the j list in a similar way, until the & num- 
ber is equaled or exceeded. This process depends basically on the fact 
that the j and k lists are ordered. 

Close Up and Make Space. The relocation of data lists, the insertion 
of new lists, and the omission or erasing of old lists are all important 
parts of data handling. If the lists are recorded in contiguous parts of 
the memory or tape, then the handling of the data must be accomplished 
so that the components of a list remain in adjacent memory addresses 
and so that spaces are not inadvertently formed between lists. Two 
pseudoinstructions that are useful 
under these circumstances are ‘‘Close 
up space” and ‘Make space.” For 
example, if a new data list is to be 
inserted between two lists already 
| stored in the memory, these already 
[ | stored lists must be separated in 

order that a number of words equal to 
the length of the new list be made 
available between them; here the 
make-space instruction would be utilized. This of course means that all 
the lists after the new list will have to be “moved over” to accommodate 
the new list. Similarly if an old list is to be erased or omitted, the 
close-up-space instruction will be used to move over the rest of the remaine 
ing lists to fill the space vacated by the omitted list. The relocation of & 
list utilizes both these pseudoinstructions, for first space must be made in 
the new location, and after the move, the vacated space must be closed up, 
These processes can be visualized as being analogous to making space for 
a new book on a bookshelf, moving the books together when a book if 
taken out, and rearranging books on the bookshelf (see Fig. 5-7). 

The pseudoinstructions must of course utilize the address of the first 
word of the list to be erased, or the first address of the location wheré 
the list is to be inserted. The label word of the list itself tells how many 
addresses are to be involved. The interpretive routine must first deter= 
mine these two items of information before performing the move. ‘The 
interpretive routine, however, must know when the moving job has beet 
completed. This is accomplished by using an end-of-data word, analogous 
to the book end of our bookshelf analogy. To close up space, each dat 
list is moved over until the end-of-data word is sensed; then this wo 
also is moved over, and the process is terminated. ‘To make space, t 
interpretive routine searches for the end-of-data word, moves it over t 














Fic. 5-7. Analogy of book shelf with 
data-manipulation processes. 
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appropriate number of addresses, and then proceeds to move the data 
lists over until the new space appéars in the proper location. 

Threaded Lists. The contiguous-component list structure which we 
have been discussing has two disadvantages. First, in order to make or 
close up space, all the lists after the space must be moved. Second, there 
was no means for dealing with the components or elements of the list. 
The concept of threaded-component list structure does not have these 
disadvantages. In its simplest form the idea is that each word which is 
«. member of a list has as part of its contents the address of the next word 
in the list, in a manner similar to the instructions of the four-address or 
one-plus-one-address systems. Of course a collection of words can be 
associated with the list word as a label. In addition to the address of 
the next word on the list, the list word may include in its contents a 
“ype” designator, which identifies it as being associated with some type 
“value.” Then special instructions can be devised that will look through 
u list until a list word of a specified type value is found. A particular 
threaded list will not necessarily have its components in adjacent mem- 
ory addresses but will generally have its components spread diffusely 
throughout the memory. 

The basic concept, then, is to have the memory entirely organized in 
terms of lists, including a list of the as yet unused memory locations. 
‘he interpretive pseudoinstruction system of the computer would never 
explicitly deal with addresses, but with lists and type values. Address- 
ing would be accomplished automatically by means of the list words. 
The exact location of any item in the memory would, in concept, not be 
explicitly known; but the item could be extracted from the memory by 
ineans of list processing. 

The use of threaded lists and sublists involves a notation enabling 
lraces to be made through sublists of lists. As an example of such a 
jotation for threaded sublists, let the words of the sublists be composed 
of three parts. The first part gives the address of the next word on the 
lint under consideration; the second part gives either the first address of a 
wublist that branches at that point or the ‘type’ value if no branching is 
(0 occur; the third part is, say, the character HE if the word is the end 
word of a list or sublist, B if a branching point occurs at that word, and 
(' if neither of these two cases occurs. It should be noted that, if the 
word is an end word, then by convention the first part is to give the 
uddress of the first word of that sublist so that the main line can be 
wontinued. For instance, Fig. 5-8 illustrates a list and sublists. Here 
/' represents some type-value code. A list or sublist is usually denoted 
by its first address; thus, for our example, list 1 has two sublists, 2 and 8 
where sublist 2 itself has a sublist 3 and sublist 8 has sublist 9, which in 
(urn has sublist 10, 
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Push-down Ltsts. The push-down list can be conceived ‘a a non 
branching threaded list made up by placing each a element ony 
of the list in the same memory location, each time ‘pushing down the 
other elements; when an e!ement is removed from the list, it is always 


Address 

















i fa li i Y . The three parallel lines represent 
Fic. 5-8. Illustration of a list and its structure. — ‘atk ° 
sie list, the two parallel lines its sublists, the single lines the sublists of the sublists, ete, 


L 
Address|elements Address|elements 





w Hl " 
(a) "pushing down" (b) "popping up 


Fig. 5-9. Illustration of push-down-list processes. 


taken off the top by removing it from this same memory location and th 
remaining elements are then made to “pop up” (see Fig. 5-9). Of cour 
it is not desirable to move the contents of words literally from one mem 
ory location to another. Rather it is more efficient effectively to chan 
the addresses of the memory locations by means of iu list index regis 
that records the address of the next word on the list. Then pushi 
down and popping up is analogous to moving backward and er 
along the list chain, where the address of the index register becomes 

effective address of the top element on the list, ‘To pop up, the addre 
of the next word in the list is inserted in the list index register; to pu 
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down, the address of the new list word is inserted into the register. New 
words for the list must of course be appropriately formed as described 
above. 

An example of the use of a push-down list is for the unused memory 
locations in a threaded-list organized memory. When required, unused 
memory locations can be obtained from this list; vacated memory loca- 
tions can be put on the top of the list. Also subroutines that use them- 
selves as subroutines can utilize push-down lists to record the independent 
parameters at any level of use (why?). Threaded lists and push-down 
lists are closely associated with each other. For example, in tracing 
through a threaded list, the stage of the sublist trace can be recorded in a 
push-down list of first words of successively nested sublists. 


5-6. Memory Space, Speed of Computation, and Automatic Programming 


Sharing of the High-speed Memory. As we have seen above, a com- 
puter can have a high-speed memory (often magnetic cores), a medium- 
speed memory (magnetic drums), and finally, a slow-speed memory 
(magnetic tapes). As one might guess from our previous discussions, 
(he high-speed memory may often be filled when an interpretive routine 
ix stored in addition to a program. Hence careful attention must be 
paid to the location of the interpretive routine so that it will not inter- 
fere with memory space required for the program. It may often happen 
(hat the interpretive routine has to be stored on the drum. This may 
slow up the rate of computation because, as will be recalled, in order to 
wwe a word from the drum one must wait for the drum to turn to the 
proper address before a word can be read out. 

This problem may be approached as follows: At the time a particular 
sibroutine of the interpretive routine is to be used, this subroutine is 
\ransferred out of the drum into the high-speed memory, where the 
computations take place. In this way subroutines of the interpretive 
routines will be performed in the high-speed memory even though they 
are usually stored in the drum. However, this means that subroutines 
ust be continually transferred out of the drum and into the high-speed 
memory and back into the drum, which takes precious time. Also, 
we have added a new requirement for the interpretive routine to fulfill. 
Now it must not only interpret the simulated instructions but must also 
fransfer from the drum to the high-speed memory the subroutines of the 
inlerpretive routine as they are needed. 

Complications. This problem becomes quite complicated when one 
in desirous of putting some flexibility into the procedure; for, in general, 
when transferring a subroutine from the drum to the high-speed memory 
i! in best to be able to put the subroutine in that part of the high-speed 
memory which may not be in use at the moment, Hence it must be 
determined where in the high-speed memory to transfer the subroutine. 





a 
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This means that a subroutine may be located in different places hes: 
used at different times. Obviously the assignment of cance, Ce) . 
depends upon the exact location of the instructions being a den , 
Hence the interpretive routine must be able to translate this su ie me 
and fix up the addresses as required, depending upon the location oO 7 
subroutine. We shall discuss this problem in the next section in m 
ae complexity of an interpretive routine can be increased still ais 
Observe, for example, that a function subroutine may be ie nee e ‘ 
in the course of computation and hence it might be desired to ala i 
in the high-speed memory after it has been transferred there. : er 
subroutines that are transferred to the high-speed memory ee 
to be used only once in that program and hence after their use nee - 
take up room in the high-speed memory. Therefore a distinction ae 
be made as to subroutines that should be transferred fempoan 
Further, during the computation for a problem, one ovat ae wi 
another subroutine. Suppose that both these subroutines ave a rea y 
been transferred to the high-speed memory. Somehow the ocala a 
routine should remember not only that these subroutines have ie : 
been transferred to the high-speed memory so that the transfer . a 
have to be repeated, but also where they have been put. a 2 u onl 
to these problems are too complex to be treated in this textboo ; 
purpose has been to give the student an appreciation of the complexi 
that can develop in planning and writing interpretive routines. a 
Advantages of the Compiler Routine. Until now we have been Nas 7 
ing the use of subroutines as the interpretation of simulation instruc ie 
during the running or computation of a program. An coated be a 
is initially to replace the simulation instructions with t Pigeon, " 
routines before the program is run. ‘That is, the program is Wrl 7 
terms of simulation instructions, but as the program is read woes _ 
computer, these simulation instructions are pupae scenes i 7 
the proper subroutines—the subroutines then becoming an in “A oo 
of the program. Of course, just as the interpretive routine ha 


‘ : : : . 
the subroutines in our above discussions, so In this new case an auto 


matic routine called a compiler routine performs the task . ni 
inserting the proper routines into the program. Examples 0 onan 
routines will be described in the next tees in this section we 

i such routines are used. ; 
a of a compiler routine over the intgepers as ee 
is that, since in the compiler routine the subroutines are yee y, —_ 
gamated into the program before the problem is run, aay ste vs 
transferring subroutines from drum to high-speed memory is ope 
Hence the problem of time lost in transferring subroutines nee “ae 
tation no longer appears, because the subroutines will already have 
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made part of the main program. Another important advantage of the 
compiler routine over the interpreter routine is that, since the compiler 
routine is only used initially, it need not be stored in the computer when 
the compiled program is running. The interpreter routine must always 
remain in the computer memory since it operates during the running 
of the program. Hence, when a compiled routine is run, more space is 
available for the main program. 

On the other hand, as can immediately be seen, after all the required 
subroutines have been incorporated into the main program, this program 
can be exceedingly long. Some of the space advantage of a compiler 
routine is thereby lost. The usual method for handling such long pro- 
grams is to divide them up into sections, each of which can fit entirely 
into the high-speed memory. Then one section at a time is put into the 
high-speed memory and computed. Meanwhile the other sections are 
stored in the drum and on the magnetic tapes. Of course a relatively 
great amount of time is spent in changing sections from drum and mag- 
netic tape to high-speed memory, and hence some of the timesaving 
advantages of the compiler routine are lost. The relative advantages 
of a compiler routine over an interpretive routine, if any, depend on the 
particular characteristics of the routines in question as well as on those 
of the program to which the routines may be applied. Further, compiler 
routines can become very complicated, for the compiler routine must 
break the program into the proper sections and assign the sections to 
parts of the drum or magnetic tapes. It must also form instructions 
additional to the main routine which will change sections at the proper 
time. Another problem arises when a subroutine appears in a program 
many times. For example, suppose that the program uses the sine 
routine often. It would be wasting space to compile the sin x routine 
ouch time it is required, having duplicates of this sin 2 routine incorpo- 
rated into the program. It would be much wiser to have only one sine 
routine in the memory and to use it in the manner of a usual subroutine. 
If the program is compiled as it is read in, the compiling routine must 
wense that the sin x subroutine will be used many times and so must 
urrange to have it as a subroutine. Naturally, if we have too many 
subroutines, so many that we have to put some in the drum, then we are 
in the dilemma described for the interpretive routine. 

In the previous sections we have discussed interpretive routines that 
inable a program to utilize subroutines while it is being computed. We 
have seen that owing to lack of high-speed-memory space this can be a 
slow process, The alternative, namely, that of compiling a program 
i it is read in, is considered in the following section. We must always 
keep in mind that the major object in using either interpretive or compil- 


jig routines is to reduce the amount of coding that the programmer 
must do, 
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5-7. Translator and Address-assigner Compiling Routines 


Kinds of Compiler Routines. The purpose of a compiling routine 
is to reduce the amount of necessary coding by allowing the programmer 
to write simulated-language instructions which are interpreted by the 
compiling routine. The difference between interpretive and compiling 
routines is that a compiling routine can be erased from the high-speed 
memory after it has completed its task. Evidently a compiling routine 
can compile mathematical subroutines as well as interpret and compile 
simulation subroutines of the kind described in Sees. 5-3 to 5-5. How- 
ever, there are other uses of compiling routines that lend themselves more 
to the technique, and this section and the one following are devoted 
to these. We shall consider four such uses: (1) as a translator, (2) as an 
address assigner, (3) as a decoder of algebraic symbols, and (4) as a decoder 
of recursion, or loop, symbols. 

Translator. Suppose that we are considering a decimal-coded binary 
computer for which we have written a symbolic code. Ordinarily specifie 
addresses would have to be assigned before the symbolic code were put 
into the computer. However, it is possible to construct a compiling 
routine that enables the symbolic code itself to be read into the computer, 
the addresses being assigned automatically by the compiler. Such a 
compiler is called a translator, since it translates from symbolic language 
into computer language. In these discussions the student should keep 
in mind that two codes are under consideration: the compiler routine 
itself and some program upon which the compiler operates. 

To be specific, let us consider a one-address decimal-coded binary 
instruction format having two characters for the operation and three for 
the address. Suppose, for simplicity in illustration, that the program 
is to add two numbers, 8,818 and 7,080. The symbolic code for this 
might be 


Instruction 


Address | Remarks 
Opera- 


tion 











CLR MT ZRO_ | Clear accumulator 

AIC AD 18ST | Add 1st number to accumulator 
A2C AD OND | Add 2d number to accumulator 
TRR TR SUM | Transfer result into sum temporary 
18ST 08 818 ist number 

2ND 07 080 2d number 

ZRO 00 000 0 constant 

SUM oe Result: sum temporary 
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If the compiler routine is to assign addresses to these instructions and 
numbers, then—in addition to the instructions themselves, in symbolic 
form—the symbolic address corresponding to each instruction, constant, 
or temporary must also be read into the computer. We might make the 
convention that the symbolic address will be read in immediately pre- 
ceding its associated symbolic contents. For example, for our code we 
would initially read the following into the computer: 


CLR 
MT ZRO 
AIC 
AD 18ST 
Instructions 
A2C 
AD 2ND 
TRR 
TR SUM 
1ST 
08 818 
2ND 
07 080 
Constants and temporaries 
ZRO 
00 000 
SUM 
00 000 


As yet there is no way for the compiler to distinguish between instruc- 
(ions and constants or temporaries. If the convention were made that 
only instructions will appear first and that all the constants and tempo- 
ruries are to be put at the end, and if, in addition, the number of instruc- 
lions in the program were to be given to the compiler, then the compiler 
would be able to distinguish the instructions from the constants and 
lomporaries. Of course the desired first address of the program must also 
he given to the compiler. Hence one word would precede the symbolic 
vode, with the convention that the number of instructions appears in 
(he operation positions and the first address appears in the a@ positions. 
lor our example this first word would be 


04 600 


if we desired the code to begin at address 600. Of course the symbolic 
vode with its first word must be read into a standard position in the 
vomputer memory so that the compiler routine will know where to work. 
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Suppose that the convention is made that the symbolic code is to be read 
into the memory starting with the first word in address 001. 

Writing the Translator Compiler Routine. It now remains to discuss 
how to form a compiler routine that will assign specific addresses to a 
code when all the above-described conventions are adhered to. A general 
flow diagram for such a compiler appears in Fig. 5-10. 

Consider box 1 of Fig. 5-10: First the real addresses would be assigned 
to the symbolic addresses of the instructions. By our conventions it is 
known that these symbolic addresses 
are found in every other word of the 
Make symbolic-address assignment table symbolic code, starting with the 
second word. The first symbolic 
address is assigned to the address 
Put the words of the code into the proper found in the a position of the first 
word of the code. Next the assign- 
ment of real addresses for the con- 
stants and temporaries is made. 
Replace symbolic operation code with This is done by looking at the con- 

real operation code A ; 3 
tents of each instruction: if the 
symbolic address (in its a position) 
is not that of another instruction, 
then it must be that of a constant 
or temporary. Such addresses are then assigned in sequence. (Note 
that we cannot have the compiler assign addresses to the constants 
and temporaries in the same manner as it did for the instructions since we 
have not informed the compiler of how many constants and temporaries 
there are.) A detailed flow diagram of box 1 appears in Fig. 5-11. 

Now that a table has been computed assigning real addresses for the 
symbolic addresses, the instructions and constants can be transferred 
into these locations, with the « positions of the instructions changed from 
the symbolic to the real address. Our convention of preceding each 
instruction and constant by its symbolic address again plays an impor- 
tant role. When the code is compiled into its real position, the instruc 
tions and constants will appear in the usual consecutive sequence. 

Finally, we must replace the symbolic operation code with the real 
operation code. For this the compiler routine must be given a table 
relating these operation codes. 

Several further details must be observed. There are some instruc 
tions in which no address appears in the a positions, e.g., shift instruc: 
tions where the number of positions to be shifted appears instead. Care 
must be taken that this is not interpreted as a symbolic address to be 
assigned a real corresponding address. Hence the instruction itself must 
be examined by the compiler to ensure that it is not one of these exce 


address and replace each symbolic address 
with its actual address 





Fra. 5-10. General flow chart of a com- 
piler routine. 
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Fie ies Also, some constants may involve a symbolic address 
‘are must be taken to recognize these and replace th i 

em 
real address. : cee 
_Summarizing, we can draw the following conclusions from our example: 
Virst, the writing of a compiling routine can be ‘“‘tricky”’ if we are to be 







Go to next 
instruction 


Assign appropriate address to 

symbolic instruction address 

Have all instructions been 
assigned addresses? 





Yes 











Is the symbolic address in 
the @ position of an instruction 
a symbolic address of another 
instruction? 


Go to next 
instruction 


Fia, 5-11. Detailed flow chart of box 1 of Fig. 5-10. 


wertain that all possibilities are taken care of. Second, in order to use a 
vompiler routine, the programmer must adhere éatefully: to all conven- 
tions required by the routine. 

Address Assigner. Suppose that it is desired to write many sub- 
routines and it is not known beforehand where they are to be located 
in the memory. One procedure might be to write each subroutine as if 
it were to be located with its first instruction in some standard address, 


eee 


e 
2 . 
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say 003, and then later to use a compiler to assign new addresses to each 
subroutine when the final decision is made on where it shall be located. 
(Recall that a similar requirement arose in connection with the trans- 
ferring of subroutines from the drum to various locations in the high- 
speed memory.) To perform this task, the compiler routine merely has 
to add to each address appearing in the code the difference between the 
desired address of the first instruction and 003. The only problem is to 
determine which are the addresses. For some instructions such as shift 
instructions do not contain addresses in all the address positions, and 
these positions should not be changed. On the other hand some con- 
stants may contain addresses, and these must be recognized and changed. 
One procedure is to require the programmer to list all the instructions 
first, then any constants that contain an address, finally any other 
constants. In addition, the programmer is required to insert words in 
front of the program that list (1) the desired address of the first instruc- 
tion, (2) the total number of words in the code, (3) the total number of 
instructions in the code, and (4) the total number of words other than 
instructions that contain addresses. Suppose that we were considering 
a two-address system; we could then put these numbers, respectively, 
into the a and B positions of the word in 001 and the and 6 positions 
of the word in 002. 

The compiler routine will first determine which instructions contain 
addresses and will add to these addresses appropriately. Next the 
program will look in the a and 8 positions of the constants that have 
addresses—recognizing these addresses as numbers between 0038 and 
003 + the total number of words in the code—and will add to these 
addresses appropriately. Finally the compiler routine will transfer the 
code from the standard position to the required position. 


5-8. Algebraic Compiling Routines 


Decoder of Algebraic Symbols. The purpose of this compiling routine 
is to enable the programmer merely to type into the computer an alge= 
braic formula, the compiling routine automatically writing the detailed 
machine-language code. Of course the programmer must also tell the 
compiling routine in what addresses the values of the independent varie 
ables will be found and into which address the result is to be placed, 


Let us first consider a function involving only the operations addition and 


multiplication, such as 
((((a + b) +e) + 4) 2) 
In order to avoid ambiguity, certain conventions must be observed, 


dot is to be used for multiplication; i.e., We must write ab, not ab, 
Also the parentheses must be placed wherever necessary and never lef 
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understood. For example, a + b-c is never written, but (a + (b- ¢)) 
instead. There is to be a parenthesis at the pacisnina and at the end of 
the function. Also the parentheses are to indicate binary operations 
only—i.e., operations involving two operands and no more. Thus one 
would never write a + b + ¢, but ((a + b) +c) instead. Actually, as 
we shall see in the following chapter, such rather artificial fanencions 
ure really not necessary, and more familiar conventions can be used 
equally well. However, for pedogogical purposes, the use of parentheses 
us described will enable us to concentrate fuller attention on the basic con- 
ee] 4 s and problems involved in constructing algebraic compiling routines 

| I'he parentheses play an important role in such an algebraic foxmula-— 
for they tell in what order operations are to be performed. For example 

in 


((((a + b) -¢) + d) -e) 


the order in which the operations must be performed is a + b = w; 
wie=a;at+d=y;y-e =z, where z is the final result. A study of 
bhe proper use of parentheses shows that they are never ambiguous. 
I) see which operations should be done first, count the parentheses aay 
from left to right. Start with the first parenthesis, and let Sach auc: 
Hoxsive left parenthesis increase the count by one inore than the previous 
(left or right) parenthesis and each successive right parenthesis dearense 
(he count by one less than the previous parenthesis. 

Consider, for example, the following: 


Pt Qs erm 


Notice that the right parenthesis of a pair is counted one less than its 
left-hand counterpart in the count and that the last right-hand carenthe: 
ain takes a count of 0, which was of course the purpose for the above 
jumbering rules. Also there is one pair of parentheses for each oper- 
‘lion + or + to be performed. 

Now we ean gain some insight into how a compiling routine might work 
io code an algebraic function automatically. To evaluate the functions 
we start counting the parentheses according to the above rule in a left- 
\o-right sean. When the count decreases by 1, this is an indication that 
i) operation is to be performed. The operation is performed, and the 
fount proceeds. The process ends when the count is 0. For ear example 
{b | ¢) is computed first; this set of symbols is then replaced by the 


{Soo 8, Kleene, “Introduction to M i 
. Ki , ) etamathematies,” p. 24, D 
Company, Princeton, N, J., 1952, sin edie. Noreen 
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result of the operation; i.e., let s = 6 + c, whence we now have 


a3) es (e+ f))) 


12 
Next compute (a-s), and let ¢ = a: s, forming G + (d- (e + f)) 
2 1 
Next compute (e + f), and let u = e + f, forming t + a . u)) 
3 2 


Next compute (d- wu), and let v = d- u, forming « + ») 
2 uy 
Finally compute (¢ + v), resulting in the final value w 
1 0 


Writing the Algebraic Compiler. The compiling routine is to wri 
instructions that will evaluate the function in a manner directly anal 
gous to the above illustration. Figure 5-12 is a simplified flow chart 
an algebraic compiler. The algebraic function can be considered as 
list of symbol codes S;, Sx, . . . , Sn which the compiler is to interpre 
The compiler must have available a list of the codes for right and le 












Is S; a left naventhasis?\2) 


Is S;a right parenthesis \4 ) Add 1 to count 


i 





Subtract 1 from count S| 









Form next instruction as 
Op a B Y 


where S; is a new symbol 












Replace 

S;-4 S;-3 Sj-2 Sia S; 

by 
S; 

in the list 


Is the count zero\8) 


Fia. 5-12. Simplified flow chart of an algebraic compiler routine utilizing the 
thesis convention. 














Exit 
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parentheses, arithmetic operations, and letters. As an example utilizing 
u three-address system, it is readily seen that the flow chart of Fig. 5-12 
when applied to our illustration will result in 


ADD c b 8s 
MLT s at 
ADD fe. 
MLT ud v 
ADD vt w 


‘This symbolic code can now have addresses assigned to it, or the address 
ussignment and compiling could have been accomplished all in one step. 

Of course we have neglected many important features of an algebraic 
compiling routine. First observe that the symbols in the final code must 
represent addresses. Hence, if numerical constants appear in the original 
formula, these must first be assigned to memory locations. This can be 
wccomplished in an initial processing of the raw input, or in box 1 of the 
flow chart. We have described algebraic functions utilizing + and - only; 
the flow chart, Fig. 5-12, will work equally well for subtraction, —, and 
(livision, /. However, the flow chart must be extended at box 4 for 
inclusion of exponentiation, subscript calculation, roots, and so forth, in 
Which case the operation must be more fully analyzed. Further analysis 
of the operation type would also be necessary if a more conventional 
hotation than that of the parentheses were used, such as interpreting 
+ beasa+ (b-c). We have illustrated a three-address system; how- 
ver, the flow chart becomes somewhat more complicated for a one- 
wddress system. For example, division is more effciently coded in a 
right-to-left scan (see Exercise 5-22). In addition a real compiler should 
thuke informed decisions between floating- and fixed-point representation 
for cach constant and partial result. The compiler should utilize the 
Hommutative law for addition and multiplication to save on instructions, 
ind so forth.f Note that five groups of memory addresses are involved 
i) this compiling routine: 

1, The locations of the compiling routine itself 

2. The locations of the words punched from the original symbols 

4, The location of the table relating the letters of the function to the 
dresses that will contain the values of these letters as independent 
variables 

|, The location of the values of the independent variables 

. The location of the program written by the compiling routine 

When the compiling routine is writing the function-evaluation pro- 
frum, (1) to (5) are needed; when the resulting program is being run, 
wily (4) and (5) are needed, 

| See, for example, R. W, Floyd, An Algorithm for Coding Efficient Arithmetic 
Hporations, Communs, ACM vol, 4, no, 1, pp. 42-51, January, 1961, 
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Recursive Compiler. Consider the problem of constructing a compiler 
program that will automatically code loops. This problem can become 
very complicated, depending on the generality with which it is desired to 
have the program perform. However, to illustrate the concept, let us 
consider the following oversimplified example: Suppose that we wish to 
construct a compiler that will evaluate a single function by successive 
substitutions—in other words, one that will automatically construct a 


Number the parentheses 
appropriately 


Write the proper instructions for 
performing the operation indicated 







Adjust the original code 
appropriately with this 
operation omitted 


Has the coding been 
completed? 


Exit 


Yes 


Fra, 5-13. Flow chart for the evaluation of a single function. 


code to perform the flow diagram of Fig. 5-13. Here let us assume th 
the function f(x) is given by a subroutine: then the only information t 
compiler needs is the number of the subroutine for f(a), the initial val 
of x, namely 2o, and the accuracy ¢ for which the function is to be eval 
ated. Of course in this oversimplified case we are tacitly assuming t 
f(x) has certain properties (why ?). 

One very simple way of constructing such a compiler is to write a © 
as indicated by the flow diagram of Fig. 5-138. Then all the compi 
need do is interpret the form in which a particular f(a), vo, and € are int 
duced into the computer and put these values into the proper addre 
of the code. Of course conventions must be made for the form of t 
particular f(x), vo, and e, for the exit, and for the final location of ¢ 
compiled code. 

Simple extensions of such a compiler might enable it to compile | 
within loops, where now some notation must be introduced to tell 
compiler which loops are within which loops. We may make the ¢ 
piler more versatile by enabling it to form partial products or part 



















Sue. 5-9] DEVELOPMENT OF AUTOMATIC PROGRAMMING 191 


sums as well as successive iterations. Again perhaps more general func- 
tions than these can be introduced. The whole thing could be integrated 
and amalgamated with the algebraic compiler to act as one large compiler, 
and so forth. 


56-9. Exercises 


5-1. Write out precisely what an automonitor program would print if applied to the 
vode of Table 3-1. 

5-2. Suppose that a code contains a negative jump instruction. How can the 
\itomonitor program handle such an instruction? After the automonitor program 
executes the negative jump, how can it regain control? 

5-3. Write a simple post-mortem debugging program as required for the third 
method of See. 5-2. 

5-4. Write in symbolic code the series of instructions that lead to the setting up of 
the a address of the transfer instruction of subroutine Q in Fig. 5-3. 

6-5. How would the evaluation of a function of two independent variables affect 
(he discussion of Sec. 5-3? 


With reference to the simulational interpretive routine discussed in Sec. 5-4: 

6-6. Write out in detail the subroutine for multiplication, floating unnormalized. 
6-7. Write out a subroutine for divide, floating unnormalized. 

6-8. Write out the codes for boxes 3 to 6 of the flow chart of Fig. 5-4. 


6-9. What other information must the format of the pseudoinstruction “Search” 
vontain in addition to the “type” of the list being searched for? (Hr: How does the 
vomputer know where the first list is?) 

6-10. Write an interpretive code for “Search,” using the flow chart of Fig. 5-5 and a 
iwo-nddress relative indexed instruction format. (Hryr: Use an eatract instruction 
(0 isolate “type’’; use a return jump to direct the interpreter routine exit to the next 
jimtruetion of the main program.) 

6-11. When the proper list has been found by the search pseudoinstruction, what 
yooord is made of its location and how can this record be used? (Hint: Remember the 
jidex register.) 

6-12. What addresses must be specified by a pseudoinstruction that will compare 
two ordered lists and record the common numbers? (Hrnt: Three lists are involved: 
ihe two ordered lists and the final record.) 

6-18. Draw a detailed flow chart of an interpretive routine that will interpret a 
slowe up pseudoinstruction. 

6-14. Draw a detailed flow chart of an interpretive routine that will interpret a 
make space pseudoinstruction. 

6-16. Deal a four-handed poker game. How would you record the results of the 
‘onl as a threaded list in a computer’s memory? 

6-16. Compare the contiguous-component and threaded-component list structures 
w# far as using sublists is concerned. 

6-17. Give an example of a subroutine that uses itself. 
lint be utilized here? 

6-18. Give an example of a subroutine that would use itself, and where this use 
would again entail its own use. How would a push-down list be used here? [Hrnr: 
i) weneral a routine for a recursive (inductively defined) function uses itself as a 
subroutine, | 

6-19, Write the specific code for the flow diagram of Fig, 5-11. 


How would a push-down 
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5-20. Write the code for box 2 of Fig. 5-10. ; ; 

6-21. Draw a detailed flow diagram for the address-assigner compiler routine 
described in Sec. 5-7. 

5-22. Using a one-address machine-language format, show by example that a 
right-to-left scan on an algebraic function results in a more efficient compiled code 
(with fewer instructions) than a left-to-right scan. [Hrnr: Use as an example the 
function ((A - V)/(Y + Z)).] ’ 

5-23. How would the flow chart of Fig. 5-12 be adjusted to account for a convention 
that enabled the omission of parentheses and utilized the rules that - and / supersede 
+ and — [e.g., where (a + (b- c)) is written as (a + be), and (a — (b/c)) is written 
as (a — b/c)]? ' 

5-24. How would the flow chart of Fig. 5-12 be adjusted to account for numerical 
constants in the function? 

5-25. Write a program for the simplified recursive compiler. 
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CHAPTER 6 


ALGEBRAICALLY ORIENTED LANGUAGES: ALGOL 


6-1. Languages 


Introduction. In this chapter we shall present an example of a com- 
plete, self-consistent automatic-programming language. In the previous 
chapter we have studied how the computer can be prepared or precoded 
with various automatic programs to aid the use of a subroutine, to trans- 
lute symbolic codes, to transfer parts of a code, to handle lists auto- 
Muatically, to decode algebraic symbols, and to compile a loop code. 
here were essentially two parts to each of these discussions, the part 
that told how to use the automatic program in writing a code and the 
part that described how the automatic program itself was constructed. 
We have been building up to the idea of a general automatic-programming 
language, in which the procedures for using a combination of such auto- 
tutic programs are unified and standardized, and for which the pro- 
u’ams themselves are made compatible with each other. 

‘Thus in this chapter we consider a new process, that of establishing 
(he unified language itself. With the combinations of methods presented 
ji the previous chapter, it is clear that a language can be devised that is 
independent of particular computer instructions. Hence a code can be 
Written in terms of the automatic language alone, and specific computer 
limtructions, the machine language, need never be mentioned. If auto- 
iulic programs were written for different computers to interpret the same 
uilomatic language, then any code written in this language could be run 
equally well on any of the different computers. 

Aw will be shown, establishing a complete and self-consistent automatic- 
jogramming language is far more complicated than just listing a set of 
jmeudoinstructions. However, before we initiate our discussion several 
words of caution are in order. First, in this text our discussion will be 
Of an informal nature, with such terms as “‘complete,”’ ‘‘self-consistent,”’ 
and even “language” not precisely defined. Although we shall continue 
{® use terms that have particular meanings in the field of linguistics, 
we do not in any way intend to make, or even imply, a true language 
Rather we hope that the normal connotation of such words 


108 
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automatic-programming “anguages.” We shall occasionally use English 
as an analogy, but only as a pedagogic aid—no other connotation is 
meant in such instances. Even our dis cussion of automatic-programming: 
languages will be from an elementary, introductory, descriptive point of 
view. We shall include only those concepts that are considered peda 
gogically necessary for the purposes at hand. Our discussions will b 
oriented directly toward the understanding of the methods involved 
automatic programming, and no rigor will be intended. The astut 
reader, however, may at times sense the more basic ideas that are bein, 
touched upon. For a deeper insight into the basis of languages, of bot 
the human and automatic-computer types, other, more detailed studi 
should be sought. 

With these qualifications in mind, it is appropriate for our purposes 
consider four ingredients of a language, { namely, characters, words, expre 
sions, and statements. These are really the hierarchical ingredien 
because words are made from combinations of juxtaposed characte 
expressions are composed of groups of words, and statements consist 
specified combinations of expressions, together with certain words. 1 
description of an automatic-programming language involves the spe¢ 
cations of these four ingredients: namely, (1) a display of the allowa 
characters, (2) rules about how to form words properly, GB) descriptie 
of the meaningful expressions of the language, and (4) discussions ab 
the kinds of statements of which the language is made, The analogy 
these structures to English is clear. 

Tn an exposition of an automatic-programming language, the dese 
tions themselves can be classified into the following types: the syntaet 
definitions, the semantic discussions, and the grammatical forms. 
syntactical definitions make explicit precisely what structures are to | 
meaningful in the language; these definitions are concerned with 
proper construction of words, expressions, and statements. The semaii' 
discussions are concerned with the meanings to be attributed to the v 
ous structures. The grammatical forms are concerned with the pro 
relationships among the different constructs, particularly the stateme 

Characters, Words, Expressions, and Statements. The characters 
language are the basic, indivisible configurations in terms of which 
language is written. For example, for English the character list inel 
the 26 upper-case (capital) letters, the 26 lower-case (small) letters, 
10 digits, the punctuation characters comma, semicolon, colon, peri 
space (i.e., blank), quotation marks, parentheses, and dash, and fi 


+ See, for example, N. Chomsky, ‘Syntactic Structures,’ Mouton & Co,, 


Hague, Netherlands, 1957. ; 
t Henceforth we shall use “language’’ to mean “qutomatic-programming lan 


unless otherwise specifically stated, 










































Suc. 6-1] ALGEBRAICALLY ORIENTED LANGUAGES: ALGOL 195 


the algebraic characters for the operations add, subtract, multiply, and 
divide and for the relations equals, greater than, and implies. (In actual 
practice this list of 77 characters used in English can be made much 
longer—see Exercise 6-1.) Similarly, for an automatic-programming 
language the characters that can be utilized must be listed and agreed 
\pon—no other characters than those on the character list can be used 
or interpreted! The English analogy is typing: no other characters than 
those on the keyboard can be used when typing on a particular type- 
writer, Since codes must be typed or punched with particular binary 
patterns for input to the computer, the necessary limitation on the char- 
acters to be used in a language is easy to understand. 

The choice of the number of different characters to be utilized in an 
wwutomatic language is, of course, up to the individual who is constructing 
the language. With human written languages as an analogy, an impor- 
(ant observation can be made concerning this choice. There are broadly 
three types of written languages: those in which each word is associated 
with a single character, called a pictograph, like (early) Chinese; those in 
which each character represents a syllable, and in which each word is 
tiade up of the characters representing the syllables, like Devanagari;{ 
nd those in which the charaeters form an alphabet and combinations of 
lo(ters are used to make up the syllables and words. Over 100,000 char- 
acters are required for a pictographic-character language (see Fig. 6-1); 
wveral hundred are required for a syllabic-character language ; relatively 
few are required for an alphabetic-character language (e.g., the basic 26 
for Wnglish and 24 for Korean—see Fig. 6-1). The lesson to be learned is 
that the efficiency of an alphabetic-character representation goes up as 
the number of ideas to be expressed by the language goes up; similarly, 
\\ is easy to see that the efficiency of a pictographic language can be high 
when the number of ideas to be expressed is small. Of course, mixtures 
of ulphabetic-character and word-character languages are ordinarily used. 
Vor example, the algebraic symbology used with English in mathematics 
loxts is really a pictographic-character language, where for example zx 
stands for the unknown, or 7 stands for seven. Even in mathematics, 
the difficulties of the word-character languages are sometimes felt, as we 
inust frequently extend our list of symbols to include the Greek, German, 
Hebrew, ete., characters also. For automatic-programming languages, 
the list of available mathematical characters is usually limited to the 
loiters, and hence it is common to use letter combinations for independent 

wud dependent variables, such as FUNCT or VAR. 
The words of a language are structures formed directly from the char- 


wetors of the language and are usually the smallest units that have mean- 
nu by themselves, 


lor example, in English a word (not necessarily an 
{ Devanagari is the basic written Indian language, 
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intelligible or “well-formed” word) is a combination of juxtaposed (side- 
hy-side) letters followed by a space or punctuation character. In this 
sume sense a number, which is a combination of juxtaposed digits, is also 
u “word.” Although the idea of the juxtaposition of characters seems 
\rivial to us, some languages, such as Korean, have their letters placed in 
irrays to form words (see Fig. 6-1). Of course in mathematics an array 
of numbers, to form, say, a matrix, is commonplace! However, because 
‘omputer print-out mechanisms are usually designed to produce rows of 
tharacters, all automatic-programming “words” are composed of juxta- 
posed letters or digits. In such languages the “words” are frequently 
ulled cdentifiers, variables, labels, or numbers. 

lxpressions are the phrases or clauses of automatic-programming 
languages and are composed of words. In such languages, expressions 
lure usually associated with some value. For example, “a + bx’ would be 
(arithmetic expression, because if the values of a, 6, and x were given, 
(he value of a + bx could be determined. Similarly ‘““n or (q and not r)” 
would be a Boolean expression, because if the truth values of Pp, gq, and r 
ve given, the value of the expression could be calculated. Finally “if 
i > b then EILEEN, otherwise ERIC” would be a designational expres- 
sion, because it has the value EILEEN if a > 6 and the value of ERIC if 
“ — b. Thus expressions are usually identified as such because they con- 
“int of a group of words that have a central or unifying meaning, such as 


“value.” This is analogous to the identification of qualifying phrases 
or clauses in English. 


































yaiaa 
a uae 
6 


maRaukwaafa aanraroragaal 


A statement of an automatic-programming language is the smallest 
lructure that represents a complete thought—or a complete descrip- 
lion of a computation to be performed—and is directly analogous to a 
wntence of English or to an instruction of a machine language. Of course 
lor un involved computation, many statements will be necessary, like a 
uragraph of English or a code of a machine language. However, just 
i» there are a limited number of operations that instructions can per- 
form for a particular computer, in an automatic-programming language 
there are a limited number of types of statements in the language. 

I summary, then, our analogy with English has displayed a hierarchy 
i structures, in which the statements (sentences) are composed of expres- 
sions (phrases, clauses), the expressions are composed of identifiers, vari- 
(bles, labels, and numbers (words), and these in turn are composed of 





Fic. 6-1. Types of written languages. (a) Selected pictographs of the Chinese , 
uage Top row: left, “man’’; right, “woman.” Second row: left, rice fiel 
echt “strong.” Note that ‘“strong’’ with a rice field on top is a “man. Th 

d 


ow: left, “peace”; right, ‘‘good.” Note that “peace” is @ woman Sila tharacters (letters, digits). To give precise definitions of meaningful or 
: , , . . F aie oe «t | 
similarly, “good” is a woman with a child. Fourth row: left, “tree”; right, a *woll-formed”’ words, phrases, and sentences is difficult to do for English, 
Fifth OW: left, “up”; right, ‘down.’’ (6) Example of syllabary characters of Dev 
2 y 


gari, the basic written language of many Indian people. The Be Sy me 
the ‘characters above and some words on the line below. (c) The es . p 

The first two rows represent the consonants; the next two rows erree i! Vv 
Words are formed by placing letters in a box. We have illustrated on Me , 
word for “hen” at the lower right; the numbers indicate the order in which the le 

are pronounced, 


fm wny human written language. This task for automatic-programming 
liiguages is, of course, much easier to do, and involves the concepts of 
syntax, semantics, and grammar that we shall now discuss. 

The Syntactical Definitions, The syntax of a language consists of defi- 
jitions of the structures to be formed from the characters of the language, 





eee 
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The concept of the recursive definition is frequently used in such syn- 
tactical descriptions of languages. Consider as an example a recursive 
definition of anumber: The idea is to build up a number from the digits 
0, 1, 2, 3, 4, 5, 6, 7, 8, and 9, and from +, —, and the decimal point, 


We begin by defining 


(a) a digit or (b) an unsigned integ 
followed by a digit 

a decimal point followed by an w 
signed integer 

(a) an unsigned integer or (b) a de 
mal fraction or (c) an unsigned v 
ger followed by a decimal fraction 


1. An unsigned integer as 
2. A decimal fraction as 


3. A decimal number as 


Finally we have the definition of 


(a) decimal number or (b) + deci 
number or (c) — decimal number 


4. A number as 


To see how this series of definitions works, let us build up the num 
— 876.123 in this fashion. We would start with the 


Unsigned integer 8 by applying definition la 


From this we would successively form 


by applying definition 1b 


Unsigned integer 87 iti 
by applying definition 1b 


Unsigned integer 876 


Similarly we would form the unsigned integer 123 by successive ap 
cations of definition 1. Then we can form the 


Decimal fraction .123 by applying definition 2 


Next we form the 


Decimal number 876.123 by applying definition 3e 


Finally we can state that we have the 
Number — 876.123 


Of course, given a collection of digits, the definitions can be utilized 
repeated application (in reverse) to determine what the collection 
For example, 256 is both a number (by definition 4a) and a decimal n 
ber (by definition 3a) and an unsigned integer (by definitions la and 1 

The problem of describing a language involves a peculiarly uni 
difficulty. Distinction must be made during the discussion between 
language being described and the language being used in making 


by applying definition 4c 
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description. If the language being described is called simply “the lan- 
yuage,” then the language in terms of which the description is being made 
ix called “the metalanguage.” Clearly the metalanguage and the language 
must be distinguishable from each other, or else much ambiguity and 
imprecision would result. In fact, many comedians utilize such ambi- 
uity as the basis of their comic techniques: as in the telephone joke, 
where the operator requests the spelling of the name ‘‘Jones” and on 
receiving a reply starting “J like in Jack,” then asks, “How do you spell 
Jack?” 

l’'requently a meta-metalanguage is required, for we consider the 
weneral names of the types of language structures to be in the meta- 
language, and hence discussions about these names must be in the meta- 
iietalanguage. We shall make the distinction between the language and 
inetalanguage by utilizing boldface characters for the language. We 
shall not generally make any distinction between the metalanguage and 
the meta-metalanguage but will leave the proper meaning and interpre- 
(ution to be obtained from the context. 

syntactical definitions can frequently be greatly shortened by uti- 
living a special meta-metalanguage symbolism. For example, let the 
symbol ::= represent “is defined as,” the symbol | mean “or,” and 
( ) indicate “the structure of the language.” Thus, for instance, 
definition 1 given above for an unsigned integer becomes 





|, (unsigned integer) ::= (digit)|(unsigned integer)(digit) 


Where juxtaposition of (- + -)(; - -) means “followed by.” In this nota- 
(ion the rest of the definitions become 

¥ (decimal fraction) ::= 
', (decimal number) :: 


(unsigned integer) 

(unsigned integer)|(decimal fraction) 
\(unsigned integer)(decimal fraction) 
(decimal number) 

|-++(decimal number) 

|— (decimal number) 


ll 


1, (vumber) :: = 


Note that if we use an actual symbol of the language itself, such as ., +, 
or —, then we do not need the brackets ( ), which are used only to 
denote generic names of the symbols of the language. 

The Semantic Discussion. By semantics is meant the discussions of 
ihe meaning of the structures that were syntactically defined. Thus the 
semantic discussion associated with “number” would consider the cardi- 
jw! and ordinal meanings of numbers and their uses. Four types of dis- 
(\iesions are generally involved: discussions of the general role and purpose 
fov the structure; of the kinds, or classification, of structures of that type; 
af the relation of the strueture’s function to its syntactical definitions; 
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and of the relation of the structure to its use in the higher grammatical 
forms in which it plays a part. 

For example, if we were discussing the semantics of English words 
we would first describe them as the indivisible basic parts of Englis 
that have content (i.e., can communicate ideas) in themselves. Thus 
word is the smallest unit of English that has meaning when taken b 
itself. Second, we would describe the kinds of words, such as nou 
verbs, adjectives, adverbs, prepositions, etc. Third, we would discu 
say, the various endings that words can have from a syntactical point 
view, with reference to the meaning of the words, such as that the ad 
tion of an s to a noun that denotes an object will make the noun plur 
Finally, we would discuss, say, the fact that nouns have particular pla 
in sentences, as subjects, objects, ete. Of course our English illustrati 
is manifestly naive if it is taken to apply literally to the English langua 
but we do feel that we have illustrated, if only in a superficial way, t 
role the semantic discussion plays in the total formulation and descriptii 
of an automatic-programming language. 

The Grammatical Forms. Consideration of grammatical forms appli 
mainly to the statements of the automatic-programming language (or t 
sentences of English) and includes discussions of both the general kin 
of statement forms and the relationships of statements, one with anoth 
Specific discussions of these must be relegated to the sections on t 
specific automatic-programming languages themselves; in this secti 
we shall discuss only the general considerations of grammatical for 
Note, however, that by ‘grammatical forms” we really mean the syn 
and semantics of combinations of statements in relation to each oth 
The automatic-programming statement, like the English sentence, is t 
smallest structure by which a complete thought can be expressed; het 
consideration of the types of meanings, and use of such statements 

relation to other statements, is of foremost importance. 
Automatic-programming languages are of course all designed for 
specific purpose, to enable the programmer more easily to tell the eo 
puter what to do. Hence analogy to all of the English language is 
really valid; rather the analogy should be made to that part or type 
English utilized in cookbooks, or other “what to do” (or, if you W 
“how to do it’) manuals. Hence in our explanatory analogy of gra 
matical forms, we shall limit ourselves to forms found in such manu 
In fact, to describe the kinds of grammatical forms (and also to add 
little flavor to the discussion) we shall utilize the analogy of an act 
cookbook recipe for making chocolate fudge, which follows: 


+ Adapted from “The Good Housekeeping Cook Book,” p. 752, Rinchart & 
New York, 1949. 


that the procedure declarations are composed of statements. 
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2 cups granulated sugar 

1 cup bottled milk 

1g teaspoon salt 

2 oz unsweetened chocolate 


2 tablespoons white corn syrup 
2 tablespoons butter 
14 teaspoon vanilla flavoring 


(NotE: Cool means to let stand at room temperature until no longer warm to the 
touch, Stir means to blend ingredients with a circular motion of the spoon, widening 
(he cireles until all ingredients are well blended.) 


Combine first 5 ingredients in saucepan over low heat. Stir constantly until sugar 
(lissolves. Then, stirring occasionally, cook gently until a little mixture, dropped in 
vold water, forms a soft ball. Remove from heat; drop in butter. Let stand without 


al irring until you can just rest hand on bottom of pan. Add vanilla; beat with spoon 
until mixture loses gloss and small amount, dropped from spoon, holds its shape. 
lurn into a greased 9- by 5- by 3-in. pan. Cool; cut in squares. 


Observe that the recipe is naturally divided into two parts: the first, 
i declaration of what will be needed in the action; and the second, the list 
ol procedures to be executed, in order, in performing the task. Analo- 
ously, automatic-programming languages contain two types of gram- 
iintical forms, called declarations and procedures. The declarations are 
ul two types: the data declarations, or records of the items to be utilized 
iliring the execution of the procedures (e.g., the list of ingredients), and 
(he procedure declarations, or subroutines (e.g., the general processes of 
Mooling and stirring, as described in the recipe) that are referred to in the 
vlunl procedure statements of the procedures. The procedures to be 
iirectly executed during the execution of the recipe are analogous to the 


jimtructions of an ordinary machine-language code, considered in Part 1 
if this text. The procedure statements describe the operations to be per- 
formed on the ingredients, or data, referring to the subroutines of the 
fleclaration part of the recipe. 


‘There are many types of procedure statements, some of which may 


Wivolve decisions—such as “Stir constantly until’ sugar dissolves. The 
iiuin field for which the language was developed, or its orientations, 
hweomes apparent on examining the actions to be performed by the col- 
levtion of procedure statements that are used in the language. Thus 
‘vombine,” “stir,” “cool,” and “beat” are characteristic of cooking, while 


‘wwnign the value to,” “take the square root of,” and “repeat the follow- 


ii computations until” are characteristic of algebraic computations. 


I lig. 6-2 we have summarized some of the above remarks. Note 


Note also 
il we can quite naturally extend the meaning of ‘expressions’ to 
hide “phrases and clauses’’ as well as words. 
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Structures of a 


Description given 
r cookbook language 


in terms of 


Procedures Declaration 










Grammatical 
forms 


Declaration forms 





Statements 


































Syntax 
definitions Types of statements, Procedure Data declarations 

Semantic including mix, stir, declarations - 
discussions blend, etc., and 


conditional 


Syntax 
definitions 
Semantic 


Character lists 


Fra. 6-2. Summary of structures of a “what to do” or ‘‘cookkook”’ language. N 
that the declaration forms are of two types: (1) a list of ingredients, or data, and 
descriptions or subroutines of processes, which latter are given in terms of statemen 
The statements and the data declarations are composed of expressions, which cam 
formed directly from numbers, words, and measure abbreviations or indirectly thro 
phrases and clauses. Finally the expressions are composed of combinations 


characters. 


6-2. Example of an Algebraically Oriented Language : ALGOL 


Purpose. As we have seen in the previous section, all automat 
programming languages are composed of expressions and stateme 
which are analogous, respectively, to the words and sentences of Engl 
or to generalized addresses and instructions of direct’ machine-instructi 
systems (i.e., machine languages). The form and purpose, respective 
of the allowable expressions and statements are described in terms 
syntax and semantics, and the rules governing the use of statements 
form codes or programs are discussed under the grammatical forms. 
languages can be interpreted in terms of such an organization; the dif 
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ences between languages are found in the precise syntactical, semantic, 
and grammatical entities defined for the different languages, and these 
definitions in turn depend on the purposes for which the different lan- 
yuages were constructed. 

The purpose of the particular language now to be considered is to 
assist in the programming of primarily algebraic numerical computations. 
‘This language is called ALGOL, which stands for algorithmic language, 
and has been generally accepted internationally as a common alge- 
braically oriented language. As noted above, a program written in 
terms of a particular language can be run on any computer for which 
there is an automatic program that can compile or interpret the lan- 
guage. This fact indicates the feasibility of the concept of establishing 
1 universally acceptable international automatic language, analogous to 
the universal written language of music. 

Toward this goal, a series of conferences and meetings were held during 
1958 and 1959, culminating in a meeting in Paris on Jan. 11-16, 1960, 
among representatives from the United States, Denmark, England, 
l‘rance, Germany, Holland, and Switzerland. As a result of the 1960 
meeting, the ‘Report on the Algorithmic Language ALGOL” was pub- 
lished by the Association for Computing Machinery.t In this and the 
following sections we shall present the basic concepts included in ALGOL. 
Wherever pedagogical considerations warrant, we shall utilize simplified 
versions of particular structures of ALGOL. For the complete language, 
the reader is directed to the reference given in the footnote. ALGOL 
was chosen as an example of an algebraically oriented language because 
it includes almost all the basic concepts of other algebraic languages and 
iis use in application as well as in publications is rapidly becoming 
widespread. 

Example of the Language. Since the purpose of ALGOL is to describe 
computational processes, the arithmetic expression, together with the 
so-called assignment statement, is of central importance. The arithmetic 
oxpression is composed of combinations of the arithmetic operations with 
numbers, variables, and functions; the assignment statement, containing 
within it an arithmetic expression, tells the computer to proceed with the 
performance of the computations indicated by that arithmetic expression. 
As an illustration of how the language works, consider the following simple 
example of a code to compute a root of Ax? + Bx + C = Ofor A = +3, 
i = +1.7, and C = —0.31. One of the two roots is given by the well- 
known formula 


—B + VB? — 4AC 
2A 





+ Bee Communs, ACM, vol, 3, no. 5, pp. 209-314, May, 1960. 
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‘The code to evaluate the right-hand side of this equation becomest 


;A:=3;B:=1.7; C:=—0.31; 
root :=(—B-+sqrt(BT2—4 XA xC))/(2XA); stop; 


Type 
declaration 
Designational 
expressions 


The meaning of this jumble of symbols becomes more or less obvious 
with the following explanation: First the symbols between semicolons 
(;.... 3) are statements. This is analogous to putting instructions, 
constants, ete., on the separate lines of a coding sheet, or to placing 
Iwnglish sentences between periods. Statements are normally executed 
in sequence. The first three statements are assignment statements and 
record the desired values of A, B, and C; the fourth statement is also 
in assignment statement and computes the root; the last statement 
slops the routine. In this example, the expressions are A, 3, B, 1.7, C, 
—0.31, root, sqrt(BT2—4xAxC), and stop; here BT2—4xAxC is an 
urithmetic expression, or the whole collection 


(—B+sqrt(BT2—4xAxC))/(2xA) 
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28 g Pe cun be considered an arithmetic expression. The numbers involved are 
o © © . . 
82 Fe 2s §, 1.7, —0.31, 2, and 4; the variables are A, B, root, and C; the function 
au oO <to 


involved is sqrt; and —, T, /, and x are the arithmetic operations. The 
wutomatie program that is to interpret this language compiles the appro- 
priate instructions and, for the square root (sgrt), refers to the appro- 
priate subroutine. 

In addition to the arithmetic expression and the assignment statement, 
iany kinds of nonarithmetic expressions and other statement types must 


Structures of an ALGOL program 
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Function 
designators 















Logical values 











2 ge ; rae ae 

§ 5 §¢2 le defined, to describe alternatives or iterative repetitions of assignment or 
E =o : . : ‘ 

3 2 28 womputing statements. Also, special declarations must be defined (just 
~ “a . . . . 

oa |% = us part of the fudge declaration consisted of special forms of “sentences” 





listing the ingredients). Altogether, a reasonably complicated structure 
in developed. Figure 6-3 summarizes the main types and the hierarchical 
a(ructure of the statements, declarations, and expressions, all of which 
will be considered in this and the next sections. 

(‘haracters. There are seven types of characters included in the 


Variables 


goto 
statement 





25 ALGOL language: namely, (1) letters, (2) digits, (3) logical values, 
Ss (1) arithmetic operation symbols, (5) relational operation symbols, 
<4 () logical operation symbols, and (7) delimiters. To be concrete, we 


oun list the different types of characters, utilizing our metalanguage 











ce = ’ . 

5's _ “ 2 olution given above :{ 

e-8 © = S c i) 

= a E FI Ee 3 Fr E E 3 | Although ALGOL does not have a stop statement, we shall use it from time to time 
as S ES5s 685s for pedagogical reasons, 

as oO an na an 





{ In this chapter we shall also use some Greek letters occasionally, for our con- 


204 venlence. Purther, we shall use italic letters for standard ALGOL forms, 
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a|b|c|dle|f|g|t|i]j|%|1|m|n|o|p|q|z|s|t\u|v|wixiyiz 
\A|B|C|D|E|F|G[H[I|J|K|L|M|N|O|P|Q|R|S|T|U|V|w |x| ¥|Z 
(digit) :: = 011|2|3]4|5|6|7/8|9 


206 
(letter) :: = 


(logical value) :: = T|F 

(arithmetic operator) :: = +\—|x|/|T 
(relational operator) :: = <|S\/=|2|>|¥ 
(logical operator) ::= = \D|V|A|7 


(delimiter) :: = (IE +]-13)1= 

First, let us make an observation about our notation: we did not need to 
use the brackets ( ) on the right-hand sides because the boldface char- 
acters are themselves in the ALGOL language. The “logical values,” 
T for true and F for false, must be distinguished from the letters T and F 
by the context. The arithmetic operator T means “raise to the power’; 
thus A? would be written AT2. The logical operators differ from the sym- 
bols used in Chap. 10 of this text; the correspondence is as in Table 6-1. 


TABLE 6-1. LoGicAL-OPERATOR SYMBOL Novation 





Meaning ALGOL Chapter 10 
Truth-value equivalence..........-++-- = = 
If, then (implies, is included I )i,ct ex es >) > 
Orie hly cicndeaided pele PROM eae Ra oats « Vv + 
BT oo copigg dnd ed deed ose eeae tenes A _ 
NOG Anis dened techs oa Wem Oe Ets TA A 








Expressions. We shall distinguish six types of expressions, namel 
(1) numbers, (2) variables, (3) function designators, (4) Boolean expre 
sions, (5) arithmetic expressions, and (6) designational expressions. Wi 
shall be concerned with the first three in the remainder of this secti@ 
and with the last three in Sec. 6-3. 

If a represents a digit, then a number can be an integer aa ** * 
a decimal fraction .aa * + + a, or an exponent part sotaa:::* aor 
combination of these, signed or unsigned. For example, 6, —471 
+137.06, 2.999112 (= 2.999 x 10%), 10-12 (= 10-2), and 310— 
(= 3 X 10-!) are all numbers. In ALGOL, a number can only act 
the contents of an address, i.e., a value, and not the address itself, i@ 
not a name or identifier. The syntactical definition of a number 
given above as an illustration of the use of the metalanguage. 

Variables are essentially the basic words of ALGOL and are the na 
(analogous to addresses) for numbers, scalar quantities, or logical valu 
as well as for “locations” of statements that can (eventually) be ev 
ated to result in a number or logical value, as used in ordinary or Boo 
algebra. ‘Two kinds of variables are distinguished, the simple vari 
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and the subscripted variable. A simple variable can be a single letter, 
or it can be a combination of several adjacent letters and numbers, but 
the first (leftmost) character must be a letter in order that simple vari- 
ables and numbers can be distinguished. Thus, for example, b, P63, 
Q382T, and ALPHA are all simple variables. In our metalinguistic nota- 
tion we can write 


(simple variable) :: = (letter)|(simple variable)(letter) 
\(simple variable) (digit) 


Note that, by recursive reasoning, according to this syntactical definition 
the simple variable must have a letter as its leftmost character. 

Subseripted variables designate values of quantities that are compo- 
nents of multidimensional arrays, such as the components of a list, a 
vector, or a matrix, etc. What would normally be the subscripts are 
placed in brackets. Thus a vector V; would be written as VECTORIi], 
u matrix M,; as MATRIX{i,j], and so forth. The “subscripts” themselves 
may appear as integral numbers, simple variables, subscripted varia- 
bles, or arithmetic expressions (see below). Note that the subscripts 
are intrinsically integer-valued, no matter in what form they appear. 
The comma distinguishes the different subscripts: for example, 
VAR[6,ALPHA,3xi+k] is a three-subscripted variable (see Exercise 
(-11). Finally we have 


(variable) :: = (simple variable)|(subscripted variable) 


To identify single-valued (logical as well as numerical) functions by the 
usual algebraic functional notation, function designators are used. The 
independent variables are enclosed in parentheses and separated by 
commas, for example, FUNCT(x,y,z). Here FUNCT (that is, the func- 
(ion name) is the same syntactical form as a simple variable and can be 
(hought of as essentially the first address of a subroutine (i.e., procedure 
declaration) that can evaluate the function; for this reason FUNCT is 
culled a procedure identifier. The (x,y,z) is the ordered set of independent 
variables, the numerical (or logical) values of which must be ascertained 
hefore the function is evaluated. The independent variables themselves 


{| For pedogogical reasons we have not utilized the identifier expression defined for 
ALGOL. The syntactical definition is 


(identifier) : : = (etter) |(identifier) (letter) |(identifier)(digit) 


the same as that for a simple variable. In addition, we differ from ALGOL by choos- 
jie to classify numbers under expressions, omitting strings, and neglecting the syn- 
inotieal definitions of sequential operators, declarators, and specificators. Once a grasp 
of ALGOL is gained from the present treatment, these slightly complicating features 
van easily be reviewed by the reader, as no new concepts are involved. See the refer- 
enoe of the footnote on p, 203 for the full ALGOL 60, 
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can be simple variables, subscripted variables, or arithmetic, Boolean, or 
designational expressions (see below). A syntactical definition of func- 
tion designators would bet 


(procedure identifier) :: = (letter)|(procedure identifier)(letter) 
\(procedure identifier)(digit) 
(independent variable) :: = (simple variable)|(subscripted variable) 


|(arithmetic expression)|(Boolean expression) 

\(designational expression) 

Goa seante, :: = (independent variable) 
(independent)? (independent variable) 
/independent-\ 


\ variable list / ) 


(function designator) :: = (procedure identifier) ( 
\(procedure identifier) 


Note that a function designator does not necessarily have an associated. 
list of independent variables. An example might be R, the procedure 
identifier for choosing a random number. Certain function names should 
be reserved for the standard functions, such as abs(N) for the absolute 
value of a variable N, sign(N) for the sign (that is, + or —) of the 
value N, sgrt(N) for the square root of N, sin(@) for the sine of 6, and 
so forth. 


6-3. Boolean, Arithmetic, and Designational Expressions 


Boolean Expressions. A Boolean expression is a rule for computin: 
a logical value, that is, T or F (see Sec. 10-2 for a discussion of definitio 
of the Boolean operations; Table 6-1 gives their correspondence wi 
ALGOL logical operation symbols). Table 6-2 summarizes the trut 
value definitions of the operations. 


TasLe 6-2. TRUTH-VALUE DEFINITIONS OF BOOLEAN OPERATIONS 





a F T F T 
b F F T T 
“7a T F T F 
aAb F F F T 
aVb F T T T 
aDb T F T T 
a=b T F F T 


For example, if a is T, b is F, ¢ is T, and d is F, then 
((aVb)V (bAc)) = ((bV d)>(aAc)) 


+ Simplified from ALGOL 60, 
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on successive evaluation becomes 
(T VF) = (FDT) 
T=T 
T 


The logical variables, in our example a, b, c, and d, could of course repre- 
sent relations whose values are true or false. For instance, a could repre- 
sent (x2y), which if x is 3 and y is 2 would be true, i.e., would have the 
value T. In fact, in this relation x could itself be an arithmetic expres- 
sion (see below), such as uxw--v, as could y (see Exercises 6-12 through 
6-15). On the other hand, the logical variables could themselves repre- 
sent other logical expressions. Such logical expressions, which can 
include relations between arithmetic expressions and also other logical 
expressions, are called simple Boolean expressions. For example, 


(((uxw-+v)2y) A (uxv)<z)) V (a Vb) = (“cD d)) 


is 2 simple Boolean expression. If a convention is made on the rules of 
precedence for the operations, then many of the parentheses can be 
omitted. The precedence rules of ALGOL are as follows: 


I"irst. 
Second, 
Third, 
lourth. 
lifth. 
Sixth. 
Seventh. 


Arithmetic expressions (according to rules to be given below) 
SS ot 


WuU<> | 


With these rules, the above simple Boolean expression can be written as 
uxw+vzZyAuxv<zV (aV b=~IcDd) 


A metalinguistic syntactical definition of a simple Boolean expression 
(hat automatically includes such rules of precedence can be given as 


follows: 
(elation :3= ateania /relational\, /arithmetic\ 
expression/ \ operator / \expression/ 
logical\ F ; 
Koolean te n= ( | 
(Boolean term) Meniced (variable)|(relation) 


| / function \ 


| ((simple Boolean\ 
\designator/ 


\ expression / 


Remark: The “logical value” is T or F; the “variable” must have 
i logical value, i.c,, must be a logical variable; the function denoted by 





210 


the “function designator” must have a logical value, i.e., must be a 
logical function. 
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(neg-term) :: = (Boolean term)|KBoolean term) 
(product-term) :: = (neg-term)|(product-term) A (neg-term) 
(sum-term) :: = (product-term)|(sum-term) V (product-term) 
(implication) ::= (sum-term)|(implication)>(sum-term) 


/simple Boolean\ , , 
\ expression / ** 


An explanation is in order! Note that each line from the “neg-term” on 
introduces another logical operator, in the order of precedence given 
above. ‘Thus, at each level, the operations previously introduced must 
already have been introduced into the expression, i.e., must already have 
been evaluated. But the right-hand side of each line has two parts, the 
first of which repeats the previously defined level; this is to take account 
of simple Boolean expressions that do not involve that logical operation, 
Finally, the basic ‘‘Boolean term” includes as a possibility a “simple 
Boolean expression” in parentheses; this is to account for terms which 
the parentheses indicate are to be computed out of the precedence order, 

As an example, consider the following syntactical analysis, where the 
abbreviations are ae for arithmetic expression, Bt for Boolean term, re for 
relation, nt for neg-term, pt for product-term, st for sum-term, im for 
implication, and sBe for simple Boolean expression : 


/simple Boolean\ _ 


= (implication) AT eeopeasion? o (implication) 


uxwtv 2y A uxv <zV(aVb= 1c 9d) 
LI 1 td orp Lf RL} 
1 ae ‘1 ae 1 1 (Bt Bui: Btt Bt 
fe, » SEL CU a Hn 
' re re ‘ mt nti | nt nti: 
Bt Bt « \ipt ptt + pt pti; 
nt nt ‘st ; st st 
pt ro st - am i 
pt um am 
st sBe 
‘ | 
' sBe 
Bt 
nt 
pt ' 

st 

am 

sBe 


Note that each symbol or combination of symbols must be develo 


















Src. 6-3] ALGEBRAICALLY ORIENTED LANGUAGES: ALGOL 211 


up to the point where it can be combined with other symbols. Also, 
we first had to show that a Vb=~—cDd was a simple Boolean expression 
before it could be incorporated into the entire analysis as a Boolean term 
(see Exercise 6-17). ; 

One further concept associated with a Boolean expression still remains 
\o be introduced. This concept is concerned with the Boolean expression 
whose value itself depends on certain logical conditions and involves the 
so-called 7f clause of the form 


(if clause) :: = if (Boolean expression) then 


The “if clause” is used in connection with a whole expression of the form 


if (Boolean expression) then (simple Boolean expression) 
else (Boolean expression) 


Such an expression is evaluated as follows: If the first Boolean expression 
ix true, then the value of the whole expression is the same as that of the 
simple Boolean expression; otherwise the value of the whole expression 
ix the same as the value of the last Boolean expression. That is, the 
value of the whole expression is the value of the simple Boolean expres- 
sion, or else it is the value of the last Boolean expression, depending on 
the truth of the “if clause.’ With this in mind, we can make this 
definition: 


(Boolean expression) :: = (simple Boolean expression) ; 
\(if clause) else (Boolean expression 


lor example, consider the Boolean expression 
if k<1 then S>W else hSc © 


‘'o evaluate this, we find the value of k, say k = —3, for which k<1 
ix true; then we evaluate the relation S>W to find the value of the 
Boolean expression. Suppose, however, that k = +3: then k<1 is 
false, and we evaluate the relation hc to find the value of the Boolean 
expression. 

ONE FINAL REMARK: -In the definition of the ““Boolean term” given 
above, we should really have had ‘Boolean expression” in the parenthe- 
wos, rather than the more restrictive “simple Boolean expression.” 

Arithmetic Expressions. An arithmetic expression is a rule for com- 
puting a numerical value. We have been building up to this important 
concept, whose full definition involves the ‘‘if clause’? and hence the 
evaluation of a Boolean expression. First, however, we shall consider 
the part of the arithmetic-expression definition that does not involve jf, 
the simple arithmetic expression. This type of expression consists of 
combinations of simple and subscripted variables, functions, and num- 
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bers, with the arithmetic operations of +, —, x, and / representin 
addition, subtraction, multiplication, and division, respectively. A 
upward arrow T represents exponentiation, i.e., raising to a powe 
Thus, for example, 

2T3 means 2° 
and 2T3T4 means (2°)! 
while 2T(3T4) means 2°” 


For p an integer aTp is interpreted as the pth power of a; for p no 
integer but a positive, we quite naturally have 


atp = e(PXin(a)) 


Certain precedences hold that can permit the omission of parenthes 
as follows: 
First. T 


Second. x, / (called mult operators) 
Third. -+, — (called adding operators) 


Unless otherwise indicated by parentheses, the evaluation of a sim 
arithmetic expression is made from left to right according to the p 
cedences. Thus 
+a—b/cx6/(p—q)T2xu/v 
means 
(a — ((((b X (e~')) X 6) X ((p — q)~*)) Xu) X (WD) 

As in the case of the Boolean expression, we can produce a set 
syntactical definitions that take account of the precedence order f 
arithmetic operations: 





. simple 
<term) :: = (unsigned number){(variable) ( ae « oe 
(factor) :: = (term)|(factor)T (term) 

(mult-factor) :: = (factor)|(mult-factor)(mult operator)(factor) 
i simple ‘ 
\eeeametic / ::= (mult-factor)|(adding operator)(mult-factor) 
expression 
/ simple x 
\ arithmetic y (adding operator)(mult-factor) 
expression 


Notice that, in the definition of ‘‘term,”’ an “unsigned number’”’ was ul 
This is done to avoid the occurrence of, for example, a+—b for a-+-(—= 
Then in the definition of ‘simple arithmetic expression’ one of 
alternatives is 

(adding operator)(mult-factor) 


Sec. 6-3] 


which puts a sign in front of the number. 
become a “term,” and not —b alone. 
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In this way only (—b) can 


As an example, let us consider the following syntactical analysis, 
where ¢ stands for term, f for factor, mf for mult-factor, fd for function 


designator, and sae for simple arithmetic expression: 


Bt, SB 18 4x 4,6). 24), 
ie ‘tt it by Ee ds 
: ee ei ie fe ay We 
mf ! f mf ie tmfoo8 
sae | 1 mf ‘| mf rat mf ' 
1 ! l J} : ' 

: ' sae mf tat sae ' 
rece ap Caceres liner 


“~ 


os 
iS 
® 
~s 


Note that we first had to show that BTf2—4xAxC was a simple arith- 
netic expression in order to find that sqrt(BT2—4xAxC) was a func- 
\ion designator. Then the function designator had to be redefined as a 
lerm, a factor, and a mult-factor before it could be used to combine with 
the factor (2XA). Similarly 2xA had to be worked up to a simple arith- 
netic expression before it could in turn take on the parentheses to become 
(he term (2XA) (see also Exercise 6-29). 

The variables and function designators that participate in the arith- 
netic computations naturally must take on numerical (as opposed to 
logical) values. It should be observed that in the evaluation of a simple 
irithmetic expression the actual numerical value of a term is computed 
iwfollows: For the unsigned number, it is the value of the number; for the 
variable, it is the current value most recently assigned in this phase of 
(he computation; and for function designators, it is the value generated 
hy the subroutine for the function (i.e., from its declaration—see below) 
when applied to the current value of the independent variables involved. 

We come finally to the “arithmetic” expression itself, which includes the 


simple arithmetic expression as well as the “if clause’’ alternatives: 


(arithmetic expression) :; = (simple arithmetic expression) 
(if clause)(simple arithmetic expression) 
else (arithmetic expression) 
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As an example of the evaluation of such an arithmetic expression, consider 
if Q>0 then S+3xQ else 3xS+Q 


If S = 4and Q = 41, that is, if the relation is true, then the value of 
the arithmetic expression is 


S+3xQ = 4+3 =7 


On the other hand, if Q = —1, that is, if the relation is false, then the 
value of the arithmetic expression is 


3xS+Q = 12—1 = 11 

Since the last arithmetic expression in the definition can also involve 
an if clause, and so on, we could have the following as an arithmeti¢ 
expression: 


if Q>0 then S+3xQ else if S>5 then 3x5+Q 
else if Q+S<0 then S else 


From our definition it follows that, to evaluate such an expression, t 
‘Gf clauses” are evaluated one by one in sequence from left to right, unt 
one having the value 7 is found: the value of the arithmetic expression 
then the value of the first arithmetic expression following this “if clause, 

ONE FINAL REMARK: In the definition of “term” given above, we reall 
should have had “arithmetic expression” in the parentheses, rather th 
the more restrictive “simple arithmetic expression.” 

Designational Expressions. Just as instructions in machine languag 
are denoted by their addresses, so in an automatic-programming langual 
statements are denoted (or named) by means of labels. We shall del 
further into this in the next section when we discuss statements. Ho 
ever, the concept of a label is important to us here because a designatt 
expression is a rule for obtaining a label from (usually) several possi 
alternative labels. Before we describe designational expressions in de 
it will be helpful if we first mention briefly how they are used in the go 
statement. The go,to statement is directly analogous to a jump instrucl 
and appears as 


go to (designational expression) 


whence the designational expression selects the label of the statem 
to which the jump is to be made. Designational expressions can also 
used in conjunction with the switch declaration, but we shall defer @ 
sideration of this use until we discuss declarations, 
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The syntactical definition of a designational expression is 





(label) :: = (variable)|(unsigned integer) 
simple Sani : 
designational } :: = (label) | (( csiguational\, 
\ Soren / \ expression / 
; : imple 
designational ou ee 
N ‘expression ) i= (designational 
expression 
simple F : 
(if clause) i diel) else { designational\, 
* coaresion / \ expression / 


lor example, we might have 


if a>b then PDQ else if c>d then QRS else 136 


llere if a>b is true, the label is PDQ; otherwise c>d must be evaluated. 


‘Then if c>d is true, the label is QRS; otherwise the label is 136. As 
nother example, consider 


if a>b then (if c>d then QRS else 136) else XYZ 


Ilore if a>b is true, then we must evaluate c>d; if c>d is true, the label 
in QRS, otherwise 186. If a>b is false, then the label is XYZ. 


6-4. Statements 


Compound Statements and Blocks. Up to now we have been consider- 
lng the kinds of structures of which statements are composed. Now we 
shall consider the statements themselves (cf. Fig. 6-3), which are the 
Operational or functional structures of the language. Statements in an 
(ulomatic-programming language are analogous to the instructions of 
inuchine language. The performance or execution of each statement is 
(he action that carries out the desired computations; a sequence of state- 
ments is generally executed consecutively unless otherwise specifically 
indicated. Five types of statements will be considered: (1) assignment 
slalements, which perform calculations as indicated by arithmetic or 
Hoolean expressions; (2) go to statements, which can break the normal 
wlntement-execution sequence according to alternatives set forth by 
designational expressions and hence afford a means for jumping to other 
parts of a program; (3) conditional statements, which can place conditions 
ov the omissions or inclusions of other statements in the statement- 
wxccution sequence; (4) for statements, which automatically carry out 
loop or recursive processes; and (5) procedure statements, which serve to 
invoke (call for) the execution of a procedure declaration, or subroutine. 


The first four of these will be considered in this section; discussion of the 
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fifth will be deferred until the declarations of the next section have been 
described. 

Before initiating the detailed discussion of these statement types, 
however, we must first consider the ideas of labels, compound statements, 
and blocks. Let us recall from our initial example that statements are 
set apart by semicolons. Thus the statement ;$; means “Execute the 
computations called for by S.” It is often desirable to give a statement 
a label, or name, L, analogous to the address of an instruction. From 
the syntactical definition in the previous section, we see that the label L 
may be a simple or subscripted variable or an unsigned integer. If L is 
to be the name of statement S, we write ;L:S;. Thus ; L:S; tells the 
computer that the label L has been assigned to the statement S (and als 
that the statement S is to be executed). Then some other statement ma 
refer to the labeled statement simply by means of its label L. 

Frequently it is necessary to consider a compound statement, i.e., 
statement that itself consists of statements. Such compound sta 
ments are set between the words begin and end; e.g., begin S;S;S;Se 
isa compound statement of four statements. A compound statement ca 
also be labeled: for example, ;L: beginS;S;S;Send;. The statemen 
in a compound statement can themselves be labeled, as in 


LC: begin L1:S; L2:S; L3:S; L4:S end 


Here the compound statement label is LC, and the statement labels a 
L1, L2, L3, and L4. 

The highest form in the hierarchy is the block, which is a collection 
statements and compound statements, but which includes also the ass 
ated declarations. In a sense, a block is an entire code or program, jul 
as a recipe is an entire description about how to prepare a particul 
type of food. A block can be labeled. Tor example, the form of a blo 
(with D’s representing declarations) might be 


LB: begin D; D; D; L1:S; L2:S; LC: begin S; S; S end; L3:S end 


where the block itself must be bracketed by the words begin and e 
and where LB is the block label, LC is a compound statement label, & 
L1, L2, and L3 are statement labels. Of course blocks can contain bloe 
for example, 


LB1: begin D; D; LB2: begin D; D; S;S;S end; S; S; S end 


where LB1 and LB2 are the labels of the two blocks involved. Since t 
block is essentially a complete entity within itself, every block au 
matically introduces a new level of nomenclature. In particular, / 
of statements, of compound statements, and of blocks are considered 
be local to the block within which they occur. By this is meant that 



























i0,, to assign the value of the expression to the variable. 
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labeled structure inside a block cannot be referred to by its label by a 
statement outside the block; and, conversely, a labeled structure outside 
a block cannot be referred to by its label by a statement inside the block. 
This is the same as saying that one cannot jump to a statement in the 
middle of a subroutine; rather the subroutine must be entered from the 
beginning. (See Grammar of Blocks, page 228.) 

On the other hand, a variable or a procedure identifier can be declared 
(see type declaration below) to be local to (only) the block in which it 
occurs. Such a local variable or procedure identifier then has only the 
meaning intended in the block; the same variable or procedure identi- 
lier used outside the block will not refer to any value or procedure that 
occurs within this block. Variables or procedure identifiers not declared 
local to the block in which they occur will be understood to be nonlocal 
to the block, and hence to represent the same entity in the block level 
immediately outside the block as they do inside the block. Since a state- 
ment in a block may again itself be a block, the concepts of local and 
nonlocal to a block must be understood recursively. Thus a variable or 
procedure identifier which is nonlocal to block A may or may not be 
nonlocal to the block B which contains A as one of its components. 

We turn now to a description of what each type of statement does, 
which is analogous to describing the meaning of operations in consider- 
ing instructions. The descriptions of the five types of statements will be 
(ast in a more formal organization than we have been using up to now. 
In the sequel for each type of statement we will first present its syn- 
(nctical definition, then present a semantic description, and finally give a 
short grammatical discussion. 


Assignment Statements. SYNTAX 
(loft part) ::= (variable) := 
(assignment statement) :: = (left part) ( arbre) 
\expression/ 
Boolean \ 
left: part) { : 
etary) \expression/ 


MHMANTICS. The assignment statement is probably the most impor- 
(unt statement of ALGOL, although it is almost the simplest. As defined 
hy this syntax, the assignment statement has the form 


; VAR :=EXPR; 


where VAR represents a variable (simple or subscripted) and EXPR can 
lw an arithmetic or Boolean expression. It is a statement telling the 


fomputer to compute the value of the right-hand side and store that 


value (e.g., a number) in the storage location designated by the left part, 
The coloned 
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equal sign := is used to indicate that the left part is to be the name 
(or address) of the value of the right-hand side. If the expression is in 
terms of numbers, such as 1.772—4x3x(—0.31), then the indicated 


operations are performed. Thus 
sY:1.772—4x3x(—0.31); 


tells the computer to assign 2.89+43.72, or 6.61, as the value of Y. A 
statement such as 
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3 A:=3; 

of course requires no computation and merely assigns the value of 3toA. 
GRAMMAR. If the Boolean or arithmetic expression of an assignment, 
statement contains variables and procedure designators (i.e., functions), 
rather than simply numbers, then the computer obtains the values of 
the variables, which must have been previously assigned, and evaluat 
the functions, by means of appropriate subroutines. Thus to compu 
(—B+sqrt(Y))/(2xA), where Y represents BT2—4xAXC and A is 3 
B is 1.7, and C is —0.31, we would write the following sequence of stat 
ments, which would be executed from left to right: 
5 A:=3;B:=1.7; C:=—0.31; Y:=BT2—4xAxC; 
root :=(—B-+sqrt(Y))/(2xA 


In executing a sequence of assignment statements, the value of any va 
able is taken as the most recent value it attained during the executio 
of the sequence. If the new value of a variable is a function of t 
previous value of the variable, the variable’s symbol appears on bo 
sides of the statement; the most recent value is used on the right-ha 
side in computing the new value, which is then assigned to the left-ham 
symbol. Thus, if we desired to add 2 to J, where initially J = 1, 
would write 
;J:=1; J:=J+2; 

where the second statement really means J (new value) = J (previ 
value) +2. Such increasing or updating of the value of a variable is 
important method for handling indices during a loop or recursive routi 

go to Statements. SYNTAX 
(go to statement) :: = go to (designational expression) 

semantics. A go to statement interrupts the normal sequence 
execution of successive statements by explicitly defining the label of 1 
successor to be the value of the designational expression of the sta 
ment. For example, if the label of the next desired statement, whieh 
not in normal sequence, is L, then we would write simply 


390 toL; 
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where L is a “designational expression’’ consisting only of a single label. 
As a more complicated example, consider the following: 


; go to if BT2>4xAxC then POS 
else if BT2<4xAxC then NEG else ZRO; 


Ilere the alternative possible next statements to be executed are labeled 
POS, NEG, and ZRO, and which one is actually to be executed next 
depends on the current values of A, B, and C. 

GRAMMAR. A go to statement cannot lead from outside a block into a 
block. This is because labels are inherently local, i.e., apply only to the 
block within which they occur. This becomes easy to see on recalling 
from above that essentially a block, like a subroutine, is operationally 
complete within itself. 


Conditional Statements. SYNTAX 
unconditional\ _,_ /assignment\|/ goto \ /procedure\ 
\ statement / *' statement / | \statement/ | \statement/ 
/ for \ | /compound\ 
\statement/ | \statement / | (block) 


/unconditional\ 


a clause) { statement / 


(if statement) :: = | (label) :(if statement) 


conditional\ : ; : 

\stalemene (if statement)|(if statement) else (if statement) 
\(if statement) else (statement) 

_ /conditional\ | /unconditional\ 


(statement) 2: = 
r ent) \ statement / \ statement / 


semantics. The if statement is the basic ingredient of a conditional 
statement and is of the form 


; if BEXP then 8; 


where BEXP is a Boolean expression (see page 211) of value T (true) or 
F (false) and S is an unconditional statement. For such a statement the 
computer would first determine whether BEXP is true or false: if BEXP is 
true, then S would be executed; otherwise S will be skipped, and the 
statement following the if statement will be executed next. Generally 
vonditional statements will involve compounded if statements, and it is 
important to note that in the compound the if statements themselves 
muy be labeled. For example, we could have a conditional statement 
of the form 


; LCS:L1: if B1 then S1 else L2: if B2 then S2; $3; 


where LCS is the label of the entire conditional statement, L1 and L2 are 
the labels of the first and second if statements, B1 and B2 are Boolean 
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expressions, $1 and S2 are the unconditional statements, and $3 is the 
statement following the conditional statement. A second form of the 
conditional statement is allowed, which differs at the end: for example, 


; LCS:11; if B1 then S11 else L2: if B2 then S2 else S3; S4; 


where again LCS, L1, and L2 are, respectively, the labels of the entire 
conditional statement and each if statement, B1 and B2 are the Boolean 


expressions, $1 and $2 are unconditional statements, and $3 and S4 are © 


statements that are not necessarily unconditional. These two forms are 
illustrated in Fig. 6-4. 





Fic. 6-4. (a) Flow diagram of the first conditional form; (b) flow diagram of the 
second conditional form. 


The oxecution of a conditional statement progresses from left to right, 
with each successive Boolean expression of the if statements being evalu. 
ated; only after the first true Boolean expression is found is the uncone 
ditional statement associated with that if statement executed. (When 
executing a conditional statement, care must be taken to distinguish 
between the if as used in the statement and the if that may appear I 
a component expression of the statement!) If this unconditional states 
ment is a go to statement, then it will define the label of the next states 
ment to be executed. If it is not a go to statement, or if none of the 
Boolean expressions is true, then the statement following the entire cone» 
ditional statement is executed next (see Fig. 6-4). The following diagram 
illustrates this situation for the second of the above forms, involving ne 
go to statements: 


B1 true > pB2 true 


; if B1 then S1 else if B2 then S2 else S3; $4; 


fab ct Roe iad LJ] 
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GRAMMAR. Frequently the go to statement is used in conjunction with 
the conditional statement. As an example of such use, consider the 
problem of finding the largest of three given numbers U, V, and W. 
The sequence of statements, according to Fig. 2-8 (page 70), will be 


; if U>V then go to UW else if V>W then go to VLN; 
WLN :LNC:=W;; go to T; VLN:LNC:=V; go toT; 
UW: if U>W then LNC :=U else go to WLN; T: stop; 


Here the largest number is to be assigned as the value of LNC. Four 
statements were labeled to be referenced by the go to statements, namely, 
WLN, VLN, UW, and T. Two conditional statements were used. This 
is somewhat reminiscent of a one-address code. 

for Statement. SYNTAX 


(tally condition) :: = (arithmetic expression) 
/arithmetic\ /arithmetic\ yore /arithmetic\ 
\expression/ \expression / \expression / 
/arithmetic\ . / Boolean \ 
‘ while ; 
\expression/ \expression/ 
(for list) :: = (tally condition)|(for list),(tally condition) 
(for statement) ::= for (variable) :=(for list) do (statement) 


step 





SEMANTICS. The automatic compilation of loops, or recursive codes, 
is accomplished by the for statement. The for statement causes the 
statement following the do (which may be a compound statement) to be 
executed repeatedly, where the value of some variable is changed or 
updated for each repetition. The for statement automatically senses 
the talky and successively evaluates the conditions stated by the “for 
list” of the for statement until every one of the tally conditions of the 
‘for list” is met. There are three kinds of tally conditions: (1) those 
(hat simply specify a particular list of successive conditions which are 
to be exhaustively met; (2) those that count the number of iterations 
hy stepping up some parameter until the value of the parameter is greater 
(han some tally constant; and (3) those that successively utilize the 
result of each iteration to evaluate a relation, with iteration continuing 
only while the given relational test holds. The three types of “for ele- 
ments’ of the syntax are designed to meet these three types of tallies, 
is will now be illustrated. 

The form of the for statement is 


; for VAR:=TC1,TC2,... ,TCN doS; 


where TC1, TC2, . . . , TCN are the “tally conditions’ which comprise 
the “for list” and § is a statement, The for statement successively 
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Ilere we initially set Y at 0 and then successively add Z and multiply by 
coefficients ¢, b, anda. These coefficients could themselves be arithmetic 
expressions, of course. 

The form of the for statement using the second type of tally condition is 


; for VAR:= ... ,A step Buntil C, .. .doS; 

























Go on to 
next tally 
condition, 

i.e.,i i+ 1 


Iterate for 
ith tally 
condition 


where A, B, and C are arithmetic expressions and S is a statement. 
Ilere the initial value of the variable VAR is set equal to the value of A; 
the statement S is then executed. Next a new value of A is computed 
us the sum of the present value of A and the value of B; if this new value 
is less than or equal to the value of C, S is executed again, using the 
new value of A. When the new value of A becomes greater than that of 
C, then the statement following the for statement is executed next. We 
can write this explanation in terms of ALGOL as follows (see Fig. 6-5c): 


; VAR:=A; L: if VAR>C then go to EXIT; S; 
VAR :=VAR-+B; go to L; EXIT; 





Go on to 
next statement 


(a) 








Where EXIT is the next tally condition, or the next statement if the “for 
list’? is exhausted. So far we have considered (the value of) B > 0; if 
B < 0, then we define the operation of the for statement as follows: 


: VAR:=A; L: if VAR<C then go to EXIT; S; 
VAR :=VAR-+B; go to L; EXIT; 


As an example of the step-until type of for statement, consider again 
the problem of finding the largest of three given (positive) numbers N[1], 









(c) N [2], and N[3]: 
Fra. 6-5. (a) Loop illustrating successive evaluation of tally conditions; (b) the first ; LNC :=0; for J:=1 step 1 until 3 do 
type of tally condition of the form for VAR:=...,A,... doSis simple substl if LNC<NIJ] then LNC:=NIJ]; stop; 
tution of A; (c) the second type of tally condition of the form ’ 
for VAR:= ...,AstepBuntilC,...doS Ilere the “for list”? has a single tally condition; we initially set the con- 
involves a loop with successive updating of VAR by B on each iteration; (d) the th tents of LNC at 0; then we compare successively each of the three num- 
type of tally condition of the form for VAR:= .. . , Awhile B, . . . dos invol hers with the contents of LNC, assigning the number to LNC if it is 


a loop where the evaluation of S itself updates A on each iteration. 


larger than LNC; finally, the value of LNC will be the largest of the 
(hree numbers. The “arithmetic” expressions A, B, and C turned out 
for this example to be just the numbers 1, 1, and 3, respectively: such 
simple arithmetic expressions will frequently occur. Note that the use 
of the if statement interprets the “next statement’ to be the if state- 
ment again with the index J changed; this is a common use of the for-if 
s(atement combination. 

The index VAR of a for statement need not appear explicitly in the 


evaluates S for VAR, taking on the successive values as directed by t 
“for list” (see Fig. 6-5a). The first type of tally condition is of the fo 


s for VAR:= .. + sA,-- .doS; 








(see Fig. 6-5b), where A is an arithmetic expression that is simply subs 
tuted for VAR when § is executed. Several of these can be used in 
for list, as in the evaluation of Y = a(Z+b(Z-+cZ)): 


: ¥:=20; for VAR:=c,b,a do Y¥ :=VARx(Z+Y); statement S. For example, consider the computation of » M;, where 


imi 
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Mix = 3Mj2 — 9Mj_,, Mo = 0, and M_; = 1. The code is 


; SUM:=0; Ma:=1; Mb :=0; for J:=1 step 1 until 5 do 
begin Mc:=3xMbt2—9xMat2; SUM :=SUM+Mc; 
Ma:=Mb; Mb:=Mc end; stop; 


Each time around the loop, J is increased by 1; then the process ends 
after five iterations, as desired, when J becomes 5. Note that a, b, andc¢ 
really correspond to j — 1, j, andj + 1, respectively. 

The form of the for statement using the third type of tally condition is 


;for VAR:= ... ,AwhileB,... dos; 


where A is an arithmetic expression, B is a Boolean expression, and S is a 
statement. Here the initial value of the VAR must have been set before 
the occurrence of this tally condition, by a previous tally condition or a 
statement before the for statement. The new, updated value of VAR is 
computed by A for each iteration. The statement S often computes the 
value of some variable that is involved in the Boolean expression B; at 
other times, B may involve the variable VAR. Then the sequence of 
computations carried out by the for statement can be summarized (see 


Fig. 6-5d) as 
; L:VAR:=A; if —1B then go to EXIT; §; go to L; EXIT; 


Here the new value of the variable is determined first ; then B is evalu- 
ated: if B is true, then —1B is false and the statement S is executed}; 
otherwise B is false, —1B is true, and the exit is to be taken next. 
As an example, consider the computation of 
3 5 ‘7 
sin@=0-Ste-At 
g2nti 


while (Qn+1! >e 


The code is as follows: 


; TERM :=0; SUM :=6; 
for J:=1,J+1 while TERM>e do 
begin TERM :=(—1xTERM x012)/((2xJ+1)x2xJ); 
SUM :=SUM+TERM end; VSIN :=SUM;; stop; 


Here we multiply the previous term by —6?/((2J + 1)2/) to obtain the 
present term; the partial sum is then formed—that is, § is a compound 
statement. This process is repeated as long as TERM>e, Note tha 
we initially do the computation for J = 1, and then we go into the while 
type of tally condition with J = J+1, Such initialization by a tall 
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condition of the first type is typical of the use of the while type of tally 
condition. 
All three types of tally conditions can be used in the same for state- 


ment. For example, 
9 
ie 
n=1 
would be formed by 
; SUM :=0; for n=1,2,3,4 step 1 until 6.n+1 while nX9 
do SUM :=SUM-+n72; 


GRAMMAR. Loops can contain loops; in other words, for state- 
ments can contain for statements. For example, consider the code for 


5 3 
d[Y e+ nG+9)] 
namely, 


H SUM :=0; for i:=1 step 1 until 5 do 
begin for j:=1 step 1 until 3do SUM:=SUM + (i+1)x(j-+2) end; stop; 


A general rule to apply is that no jump may be made into the range of a 
for statement (i.e., into the “do statement” S, which may be a compound 
statement ) except during the use of subroutines not affecting the for 
indices. The following diagram indicates valid jumps (i.e., if followed 
hy go to statements, or just go to statements): 


jfor ... do begin 


(i aera ae a 
utine?\subroutine 


subro 


If the exit was made through a go to statement (as part of the compound 
alatement S), the value of the index, or controlled variable (i.c., VAR in 
the above forms), will be its final value reached just before the ‘exit: If 
the exit was due to the completion of the index or controlled-variable 
range, then the value of the controlled variable is undefined after the exit. 


6-5. Type, Array, and Switch Declarations 


Need for Declarations. In our chocolate fudge illustration of Sec. 6-1 
we observed that, before the actual procedure of the recipe was piven. 
A lint of ingredients and certain special notes were needed. In our die 
(ussions in Sees, 6-2 through 6-4, we have defined the basic expressions 
wid statements of our language, in terms of which we can write the actual 
eode, which are analogous to the procedure of the recipe. We turn now 
to the structures in terms of which we write the declarations of our lan- 
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guage, which are analogous to the list of ingredients and the notes of our 
recipe. In this section we shall consider those declarations that describe 
the nature of the variables involved (analogous to the ingredients list); 
and in the next section we shall describe the procedure declaration, or 


subroutine (analogous to the notes of our recipe). 


Type and Array Declarations. In order that the computer will know 
what kinds of operations may be performed on the variables involved in 
a code, and also that it will have a record of the names of these variables, 
the variables are declared as to the type of value they represent. The 
types of variables are (1) real, meaning such variables may assume posi- 
tive, zero, Or negative values; (2) integer, meaning such variables may 
tegral values, including zero; and 
(3) Boolean, meaning such variables may assume only the truth values 
T and F, for true and false respectively. The syntax of a type declaration 


assume only positive and negative in 


would therefore be 


(type) ::= real\integer|Boolean 
(variable list) :: = (variable)| (variable list), (variable) 
{type declaration) ::= (type)(variable list) 


For example, for our illustration of finding the quadratic root, we woul 


declare 
real A,B,C 


whereas in the double summation illustration above we would declare 


integer i,j 


In addition to those specifying the type, there are specification deel 
rations that describe variables (usually subscripted) which represe 
arrays, such as vectors or matrices. The array declaration tells the ¢o 


puter how many elements there are in the vector or matrix, so that t 
e reserved. The form of an arm 


appropriate memory space may b 
declaration is 

array VAR[Li :Ui,Lj :Uj] 
where for each index of the variable VAR the lower and upper bounds 
given. Thus for the matrix 


au G12 
G21 22 
a31 432 


we would write array MATRIX(1:3,1:2]. For a vector V with five 
ponents (Vi, V2, Vs, Va, Vs), we would write array V(1 :5], since the 
only a single index involved. The notation extends to any num 
indices, and if more than one array is being declared at the same t 
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the list is separated by commas; e.g., 
array MATRIX[L:3,1 :2], V[1 :5] 


ee generality, the upper and lower bounds may themselves be 
arithmetic expressions. Also, in order that the “type” of the array ele- 
me nts be known, the type is usually specified before the word 

lhus we might have meee 


integer array V[r—s:r-++s] 


me there are 2s + 1 elements in the vector, each of which is an integer 

3 witch Declarations. The switch declaration acts as a variable covmecior 
y specifying the labels of alternative statements to which a go fo stat 

ment may jump. The syntax of a switch declaration is as Pie vr 


es designational\ 


: /designational\ 
expression / 


(switch list) : : (switch list), \ acy 
expression 


switch \ |. ; F 

\declaraltnay. 20 switch (variable) := (switch list) 

| he apes of the switch declaration is as follows: The switch list 

i , ne oe an ordered list of labels, since it will be recalled that a desig 
tional expression is it: i its 

a s itself a label or has a label as its value. Thus its 


switch NAME :=LA,LB,LC 


Cee ag is Tee of the switch itself and LA,LB,LC is the list 
abels. e switch, or variable co i 
iment of the syntactical form porte ng ag 


go to (switch designator) 
where for “switch designator” we have 
(awitch designator) :: = (variable)[(integer)] 
Thus the switch designator is of the form 
NAME[N] 


Yee NAME is the name of the switch corresponding to the switch 
declaration and N is an integer telling which label of the switch decla- 
rition to use. Thus if LA, the first label, is to be used, then N would be 
wt to 1, and we would write go to NAME[1]; if the second label, LB, is 
Wo be used, then N would be set to 2; and so forth. For exam 1 

sider the problem of forming . al 


A=X?+Y ifN=1 
A=X+¥Y* ifN=2 
A= 0 ifN= 3 
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We would form the required N, and then by means of the switch we 
would jump to the proper assignment statement : 


; switch SW :=P,Q,R; J:=N; go to SWIJ]; 
P:A:=XT2+Y; go to T; Q:A:=X+YT12; go to T 
R:A:=0; T:stop; 


We could have used a more general description of the “switch designator” 
by utilizing an integral-valued arithmetic expression in the square brackets 
rather than only an integer. Similarly, we could generalize the switch list 
to itself include switch lists by extending the syntax of the designational 
expression to include switch lists (see Exercises 6-43 and 6-44). 

Grammar of Blocks: Local and Nonlocal Variables. Recall from above 
that a block is essentially a compound statement that includes some 
declarations at its beginning. A declaration for a variable specifies the 
nature of that variable for one block only; i.e., the declared variable is 
local to the block in which it is declared. Outside that block, the vari 
able can be used for other purposes but can never refer to its use within 
the block in question. Hence a variable used inside a block but not 
declared in it, ie., not included in a type or array declaration in t 
block—such a variable is nonlocal with respect to the block and will rep 
sent the same entity immediately outside the block as well as within if 
Of course every variable must be declared in some block. Then a varl 
able represents the same entity for all blocks which are subcomponen 
of the block in which it was declared. Labels of statements are alwa 
local to the particular block in which they occur. Sometimes the ter 
scope is used for the range of statements for which a variable has t 
same meaning. Then the scope of a variable is the set of statemen 
within the block for which the specifying declaration for the variable W 
made—and this includes all “subblocks’”’ within this block. When 
statement is entered, the value of any variable is always assumed to 
its most recent value, in the “Jocal’’ sense. 


6-6. Procedure Statements and Declarations 


Need. The statements discussed above—the assignment, go to, co 
ditional, and for statements—enable the evaluation of arithmetic @ 
Boolean expressions and also present a certain amount of ‘decisi@ 
making” and “jumping” capability. These statements are sufficient 
handle any algebraic computation. Clearly, however, there will be 
large variety of other entities which, though they cannot be predicted 
advance, will be suggested by the nature of specific problems as t 
occur. Hence room must be made in our language for the incorporm 
of special computational entities as desired—or, in more familiar wo 
there must be a method in our language for the use of subroutines, 
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course, these computational entities will themselves be expressed in 
terms of the four basic statements. This requirement results in the 
need for procedure statements and procedure declarations. A procedure 


st atement tells the computer to execute a procedure declaration, which 
latter is the subroutine itself. 


Procedure Statements. SYNTAX 





(actual parameter) ::= (variable)|(expression)|(function designator) 
actual- actual- . 
L erenetee / ::= (actual parameter) | ( parameter }, (actual parameter) 
list \ list / 
actual- / actual- \ 
\ Patamioter / ::= (empty) | (\ parameter ) 
part list / 
procedure\ . ‘ ; Bovual 
Nctaiaeseaee (procedure identifier) (parameter 
part 
SEMANTICS. 


A procedure statement tells the computer to execute a 
procedure declaration or subroutine which appears in the declaration of a 
program. The ‘procedure identifier’? constitutes the name of the sub- 
routine (Just as it constituted the name of the function as described 
above), and the “actual-parameter list’? in the parentheses is P list. of 


hoth the dependent and independent variables involved. The form of the 
procedure statement is 


; PROCID(AP1,AP2,AP3); 


W here PROCID is the procedure identifier and, say, AP1 is the dependent 
viriable and AP2 and AP3 are the independent variables. Frequently 


(here may be more than one dependent variable computed by a procedure 
slatement. For example, 


; TRIG(0,2,EXPO,SIN,COS); 


might be the procedure that tells the computer to execute a subroutine 
(enoted by TRIG that has as its independent variables an angle @ and 
iii wecuracy e and computes as dependent variables EXPO, whose value 
ine’; SIN, whose value is sin 6; and also COS, whose value is cos 6. Of 
vourse, to find out just what the meaning of the dependent and tide: 
pendent variables actually is, one must refer to a description of the 
vorresponding procedure declaration or subroutine. 

GRAMMAR, ‘The procedure statement does not necessarily use the same 


" " y g 
net ual parameters” in the actual-parameter part as the procedure decla- 
fation itself does, For example, 


; TRIG(oxte,U,V,W); 
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4 . oc l 
i in) the TRIG subroutine: the order of the “‘actua 
would also invoke (call in) the ee 


Thus in our example Xt 
SIN, and W to COS. Sup- 


parameters” identifies the actual correspon 
independent variables of the subroutine. 
corresponds to 6, e to ¢, U to EXPO, V to 
pose we desired to use this subroutine to compute 


e{A sin (wt) + B cos (wt)] 


The procedure statement in the actual code would b 


e 


[Cuap. 6 


_ | TRIG (oxt,,U,V,W); Z:=(AXV+BxW)/U; 


Here U, V, and W would be com 


i te 
i dthen their values would be used to compu 
ala t, and e were computed by previous state 


assumed that the values of o, 
ments in the program, represented by . « -). 

Procedure Declarations. Finally, we come 
itself. Its syntax is as follows: 





(formal parameter) ::= (simple variable) 
formal- formal- 
( parameter } ::= (formal parameter) 
. list / list 
formal- \ / formal- 
/ parameter } 2: = (empty) | (phremietet ~ 
\ / list 
part 
(variable list) ::= (variable)|(variable list), (variable) 


Ghee, :: = (empty)|(variable list} ; 
par 


(type) ::= real\integer|Boolean 


(specification\ = (empty)|(specifier)(variable list); 


\ part = / 


/ function ) 
(specifier) :: = (type) |switch|(type)array | (type) procedure \ designator | 


(specification, (pecifier)(variable list); 


\ part 


rocedure\ |. _ t 
aeaiing / **~ \identifier / ante’ F/>\ part /\ part 
(prose) ::= (statement) 

ody 


. procedure 
(procedure declaration) :: = procedure heading 


) 


/procedure 


\ 


body 


/procedure\, / formal- _ /value\ /specification 


) 


to the procedure declaratio 


A DaFAmelse / (formal paramete 


) 


puted first by means of the TRIG sub- 
Z (it is of course 
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The semantics of a procedure declaration are as follows: The word 
procedure indicates that what follows is a subroutine; the ‘procedure 
heading” is the name of the subroutine and is followed by the list of 
“formal parameters” (the independent and dependent variables) in 
parentheses, together with certain specifications, concerning the vari- 
ubles, which enter into the subroutine. Note that the ‘‘procedure body,” 
i.c., the subroutine itself, is defined as a single statement, and hence must 
be written as a single compound statement, in the usual begin . . . end 
notation for such statements. Thus the form of a procedure declaration 
will be 


/value\ /specification \ 2 . 
\ipart / \ pare / begin... end; 


— 

procedure-declaration heading procedure body, 
i.e., main part 
of subroutine 


procedure NAME(V1,V2, ... ,VN); 


All “formal parameters,” i.e., the independent and dependent, variables 
listed in the formal-parameter part (for example, V1, V2,... , VN of 
our form), must be declared as to type in the “specification part.”’ If one 
of the formal parameters is an array, it must be declared as an array, 
ind also as to “type” of elements of which the array is composed. Any 
switch used in a procedure body may be declared in the “specification 
part,” but as was seen above, switch declarations may also be used in 
the body itself. Note that the procedure may involve some function: 
in the formal-parameter part, the function is indicated only by its ‘‘pro- 
wedure identifier,” or variable name; in the specification part, however, 
the type and the function designator are listed. We shall get to the 
ieaning of the “value part’’ in a moment. 

The effect of a procedure statement is to replace the statement by the 
procedure declaration and to execute the “‘statement” that comprises the 
subroutine of the declaration. Recall from above that the formal param- 
olers of the procedure declaration are associated with the actual param- 
wlors of the procedure statement in a one-to-one ordered correspondence, 
(he first formal parameter corresponding to the first actual parameter, 
(he second formal parameter corresponding to the second actual param- 
wler, and so forth, from left to right. When the procedure declaration 
toplaces the procedure statement, those formal parameters that are inde- 
pendent variables of the procedure must take on the most recent values of 
the corresponding actual parameters. No problem results when each 
ictual parameter is a single variable, for the corresponding formal param- 
elors will then be replaced by the most recent values of the actual param- 
tlors in the body of the procedure. But consider the case where the 
ielual parameter is itself an expression involving some arithmetic or logi- 
tal operations, Then there are two possible courses of action; the first is 




































































232 AUTOMATIC-PROGRAMMING LANGUAGES [CHAP. 6 


a 


to have the expression evaluated, and then have this value replace the 
corresponding formal parameter in the procedure body; the second is to 
replace the formal parameter in the procedure body by the corresponding 
actual parameter expression enclosed in parentheses, leaving its evalu- 
ation to be accomplished during the execution of the procedure. The 
first course of action is best when the formal parameter occurs frequently 
in the procedure body, for then the corresponding actual parameter 
expression need only be evaluated once. Those formal parameters for 
which this first course of action is desired are indicated by listing them in 
the “value part” of the procedure heading. 

Examples of Procedure Declarations. As a first example, consider the 
procedure declaration corresponding to the procedure statement 


: TRIG(@xt,2,U,V,W); 


used above in the evaluation of e~*[A sin (wt) + B cos (wt)|. The pro- 
cedure computes 


En Le Le 
CS hat hat sit ay apr 


63 9° 

sin @ = 6-3) + 5 aa 
0° 64 

cos 6 = 1 — a) + 4) — i 


To evaluate these three functions in the same procedure, we could fir 
form 6"/m! and then add this (with the appropriate sign) to the respee 
tive partial sums as called for by the series. We use a switch declaratio 
for the alternation of sign for the sine and cosine. The procedure deel 
ration would be 


procedure TRIG(6,2,EXPO,SIN,COS); value 6,¢; 
real 0,2,EXPO,SIN,COS; switch SAC :=S,C; 
begin real SUME,SUMC,SUMS,TERM; integer I,J; 
TERM :=0; SUME :=1+6; SUMS :=6; SUMC :=1; J:=2; 1:=23} 
B:TERM :=TERM x6/J; 
if TERM>« then SUME :=SUME+TERM else go to R; go to SAC|I 
C:SUMC :=SUMC+(—1)T(J/2)x TERM; I:=1; go to T; 
S:SUMS :=SUMS-+(—1)T((J—1)/2) x TERM ; 1:=2; go to T; 
T:J:=J+1; go to B; 
R:EXPO :=SUME; SIN:=SUMS; COS :=SUMC end 


Note that the inputs to the subroutine are 6 and e, in that order; t 
outputs are EXPO, SIN, and COS, in that order. The switch is reset 
go through the alternate sin (or cos) loop at the end of each loop, Final 
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the partial sums SUME, SUMS, and SUMC are assigned to their corre- 
sponding variables. 

As another example of a procedure declaration, consider matrix multi- 
plication, where it is desired to compute 


K 


yee ey 


Ca =) AgBr id ee ee ee ce ee | 
- 
‘he procedure declaration becomes 


procedure MATMULT(Aii,j],B[j,k],Cli,k]); value I,J,K; 
integer I,J,K,i,j,k; real array A[1:I,1 :J],B[1 :J,1:K],C[1 :I,1 :K]; 
begin for i:=1 step 1 until I do 
begin for k:=1 step 1 until K do 
begin C[i,k] :=0; for j:=1 step 1 until J do 
Cli,k] :=C[i,k]+-Al[i,j] x B[j,k] end end end 


As a final example consider integration by means of the trapezoidal 
rule, namely, 


b 
A = i fla) dv = 4 (f(a) + 2f(a + h) + 2fa + 2h) + +++ + F0)] 


Here we shall utilize what is called a comment declaration, which is sim- 
ply a way of inserting into the procedure descriptions that are ignored 
hy the computer but are of some use to the programmer when reading 
(he procedure declaration. 


procedure TRAP(FCT,a,b,e,V,A); value a,b,e,V; real a,b,e,V,A 
real procedure FCT(x); comment a and b are the min and max, resp. 
of the points defining the interval of integration. FCT(x) is the furie= 
tion to be integrated, ¢ is the permissible difference between two suc- 
cessive trapezoidal sums, and V is greater than the max absolute value 
of FCT(x) on (a,b); 
begin real Ibar,h,J,S,1; integer n,i,k; 
hi=b—a; Ibar:=Vxh; J :=1/2x(FCT(a)+FCT(b)); I:=hxJ 
$:=0; n:=1;i:=0; for i:=i+1 while absa—Ibar)>¢e do 
begin h:=h/2; Ibar:=1; for k:=1 step 1 until n do 
S:=S+FCT(a+hx(2k—1)); 
l:=hx(J+S); n:=2tTi end; 
A:=I end 


Note that for the 7th iteration we add 2‘ more points to S, namely, the 
ww points between the previous points. Then 


hx((1/2)x (FCT(a)+-FCT(b))+S) 


hocomes the new approximation to the integral. Note that the standard 
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function abs(y), the absolute value of y, need not be declared. 
ise 6-50. 

TE aks ae Procedure Statements. The procedure statement ha 
then, the same effect as a function. The differences, however, are 
follows: First, the “function designator” can stand only for a singl 
valued function, whereas the procedure statement may eall for a mult 
valued result. Second, the independent-variable list, analogous to y 
“actual-parameter list,’”’ contains only independent variables, whereas, 
the procedure statement, both dependent and independent variables q 
included. For these reasons, the function designator can be include 
within an arithmetic expression, while clearly the procedure stateme 
could not be so included. In fact, it is for the same reasons that t 
procedure statement has to be a complete statement by itself. Of cour 
the same effect of a function designator can be obtained with a procedu 
statement. For example, suppose that sqrt(Q,Y) is the procedure sta 
ment that computed Q = /Y. Then our quadratic root would 
computed by 


- A:=3: B:x1.7; C:=—0.31; Y:=BT2—4xAxC; 
a sqrt(Q,Y); root :=(—B+Q)/(2x 


where sgrt(Q,Y) would call in the square-root subroutine. 

The use of a procedure declaration for a function 1s similar to the ul 
of an ordinary declaration in that the actual parameters of the functi 
designator correspond in order to the formal parameters of the functi 
procedure. For example, suppose QFT (u,v) is the function designator 


f(u,v) = au? + buv + cv? 

To see the use of the function in a code, consider the evaluation of 
Z = Af(r?,s?) + Bf(r?,s*) 

After declaring the function in a procedure declaration (see below), 


would write 


Z:=AXKQFT (rt2,8T2)+B x QFT(rT3,sT3) 


The procedure declaration for a function is the same as pe for 
ordinary procedure statement except that the type must precede the Ww 
procedure and that a dependent variable never appears. Instead, wit 
the procedure body a value must be assigned to the function a 
i.e., to the name of the function. For our QFT(u,v) we would ha 
for a = 3.8, b = 6.2, andc = 1.1, 


dure QFT(u,v); value u,v; real u,v; begin 
se pis PR :=6.2; ‘ s1.13 ‘OFT :caxutl2+bxuxv-+e xvT2 
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As another example of a function procedure consider the evaluation of 
sin @ by a series: 


real procedure sin(0,e); value 6,e; real 0,8; begin 
real TERM,SUM;; integer J; TERM :=0; SUM :=0; 
J:=0; for J:=J+1 while TERM>: do 
begin TERM :=(—1xTERM x612)/((2xJ+1)x2xJ); 
SUM :=SUM-+TERM end; sin:=SUM end 


‘To use this function procedure consider the computation of 





Z = 3sin? (wu — v, 2.) + V1 — sin? (u? — v?, e) 


fore = 1/n. After declaring sin(6,2) we would write 


; Z:=3x(sin(u—v,1/n))T2 + sqrt(1—(sin(ut2—vt2,1/n))T2); 


Since we have declared the value of @ and e, the computer would first 
‘ompute u—v and 1/n for the first sin evaluation and uf2—vT2 and 
1/n for the second sin evaluation before entering into the procedure itself 
vither time. 

Procedure declarations can utilize themselves (by means of the corre- 
sponding procedure statements) recursively. For example, consider 


real procedure FACTORIAL(n); value n; integer n; 
begin FACTORIAL := if n=0 then 1 else nx FACTORIAL(n—1) end 


6-7. Exercises 


6-1. List at least 10 more characters used in English in addition to the 77 mentioned 
in Sec. 6-1. (Hin: Look at a typewriter keyboard.) 

6-2. What difficulties are involved in defining a “well-formed,” intelligible word? 

6-8. Define a “well-formed” number. (Hint: See Sec. 9-11 on significant figures.) 

6-4. For written English, define phrase, clause, and paragraph. 

6-6. Define a “well-formed” algebraic equation. 

6-6. Identify the following as an unsigned integer, or a decimal fraction, or a decimal 
Number, or a number, or some combination of these: 


(a) 386. 


(b) . 28. 
(d) .028. 


(e) 0100.01. 


(c) —3.65. 
(f) 000.000. 

6-7, Discuss the semantics of ‘“numbers’’ (i.e., the ordinal and cardinal meanings), 

6-8. Discuss in detail the semantics of a verb. 

6-9. Use the meta-metalanguage symbolism ‘to define a “well 
lixercise 6-3), 

6-10, Extend the metalanguage, or symbolic-syntax, definition of a number given 
ii See, 6-1 to include the exponent part of the form y+aa..: a, 

6-11. Write a symbolic syntactical definition of a subscripted variable. 
Une the definition of a function designator of See, 6-2 as a guide.) 


-formed”’ number (see 


(Hint: 
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Calculate the truth values of the following three Boolean expressions (for real 


numerical values): 
6-12. jal + Jb] => Var? +b?>Apt+q>p-—g. 
6-18. 2? >@oPt+e>pP-P=Pp>qIVep<qVP=% 
3, b 


6-14. For a = = 6,c = —4, and d = 8 evaluate 
(a+b >at tbe = at — a < abel) Vb — sac >b — date Aa <a 


6-15. Evaluate the expression of Exercise 6-14 with a = 8, b = —4, c = 6, and 


d = 3. 
6-16. In the expression 


uxXw+vzy AuxXv<zV(aVb=" IcDd) 


if the parentheses were omitted, how would the order of evaluation be affected? 
6-17. Identify the line of the syntactical definitions which is utilized in each stage 

of the development of the simple Boolean expression given on page 210. 
6-18. Diagram the expression of Exercise 6-14 as illustrated on page 210. 


Evaluate the following three Boolean expressions (where a, ¢, f are false and 


d, g are true): 
6-19. if if if a then b else c then d else f then g else sin2+.05>tan2 


6-20. if a then —]b else if c then d else if f theng else sin2+.05>tan2 
6-21. if ifaVbAc=f then adb =cAd else T then F else T 


6-22. Express the Boolean expression 
if a then b else c 


in terms of the Boolean operations only. (H1nr: Write out the truth table for t 


expression. ) 
6-23. Express the Boolean expression 


if if a then b else cc then d else a 


in terms of the Boolean operations only. 


For a = 3, b = 4, c = 6, evaluate the following five arithmetic expressions: 

6-24. a—b/cxX6/(b—c)T2Xa/b 

6-25. b/c/bXat3xcta/b 

6-26. if a—b/cT2=b—c/atc then a—b/cX6/(b—c)T2Xa/b else 
b/c/bXat3xcTa/b 

6-27. if a>b thenc else if b>c thena else if c>a then b else c 

6-28. if b>c then a else if c>a thenb else if a>b thenc elsea 


6-29. Diagram the expression for root shown on page 205, as illustrated on page 21 
compare the diagram with that for root on page 213. 


Perform the indicated statements.and find the value of Zz: 

6-30. A:=3; B:=1.7; C:=—0.31; Y:= if BT2—4ACSO then sqrt(A+B) 
sqrt(B—C); Y:=Y+10; Z:=Y 

6-31. A:=3; B:=1.7; A: =A-+B; B:=BT2+2XA; Z:=A+B 

6-32. A:=3; L:A:=A+2; go to if 10XA—AT220 then L else M; 
M:Z:=At2—16 
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6-83. A:=U; B:=V; C:=W; go to if UZV A UW then ULN else 
if VZUA V2W then VLN else WLN; ULN:Z:=U; 
VLN:Z:=V; WLN:Z:=W 
6-34. if if VSUA VSW then Z:=V else if USVAUSW then Z:=U else 
Z:=W a 
6-35. Using a single conditional statement of the first form, write a code that 
computes min(a? — be, a — b/c). 
6-36. Using a single conditional statement of the second form, write a code that 
computes min {max (a,b,c),max (d,e,f)]. 
6-37. Discuss the difference between the uses of if in an expression and in a con- 
ditional statement. Give an example that uses both types of if in the same statement. 


Write codes in the ALGOL language to compute the following five functions: 
2. 
“ al(n — 2)! 


n 
! 
6-39. = nr Lanz 
. s tlm —aiPe * 


zx 


6-40. x 


ll 
twee 


Y +965. 
17=0 
6-41. =~ 2 __at (2)(4)__x* 
acon? =F a[1+5r eat wear t *') 


x 
1+ 


3+ 
5 + 


a 








6-42. arctan x = 
x 


40? 
Qa? 
162? 
7 + 9 eee 


6-48. Extend the syntactical definition of a designational expression to include 


switch lists. Discuss the advantages of such an arrangement. 


6-44. Write a syntactical definition of an array declaration. 
6-45. Certain parts of the procedure-statement and procedure-declaration syn- 


tnetical definitions utilize the form (empty). In each case where it appears, give a 
specific example of how this case may arise. 


Write procedures for determining the values of the six following: 
6-46. 2 = VN. 
6-47. The trace T of a matrix (a:;), where 

n 


T= ) ax 


i=1 
6-48. The transpose of a matrix, where 
ais’ = ai 


6-49, That element of a matrix which has the greatest absolute value. 
6-60, The integral A of f(x) to an accuracy of ¢ by Simpson’s rule (see Sec. 9-6). 
6-61. An element of a table by the method of Fig. 3-11. 


6-62. Write a procedure for ordering a list of numbers (see Sees, 11-5 and 11-6), 
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Consider the following procedure: 
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real procedure GPS(I,N,Z,V); real 1,N,Z,V; 
begin for 1:=1 step 1 until N do Z:=V; GPS:=1 end 


6-53. Using the GPS procedure, write a code for forming the scalar product of two 
vectors. 


6-54. Using the GPS procedure, write a code for forming the vector product of two 
vectors. 

6-55. The GPS procedure is called (modestly) the “general problem solver” by 
D. E. Knuth and J. N. Merner, in their “ALGOL 60 Confidential,” Communs. ACM, 
vol. 4, no. 6, June, 1961, because, in concept, by using the GPS any computable fune- 
tion can be computed. Discuss the use of GPS from the point of view of ALGOL as 


applied to the processes of matrix multiplication and finding the mth prime number as 
given in that article. 
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CHAPTER 7 


BUSINESS-ORIENTED DATA-PROCESSING LANGUAGES: 
COBOL 


7-1. Common Business Oriented Language COBOL 


Introduction. In the previous chapter we studied an algebraically 
oriented language, ALGOL, which was used to illustrate the concepts 
involved in the use of the basic operational procedures and subroutines 
of an integrated, self-consistent automatic-programming language, 
Attention was particularly focused (1) on the arithmetic, Boolean, 
and designational expressions as the fundamental ingredients of the 
procedures; (2) on the basic types of procedures, including the simple 
“assignment,” the “conditional,” the repetitive “for,” and so forth, as: 
presenting sufficient manipulatory capability to perform any general 
computation; and (3) on the method of utilizing subroutines (“procedure 
declarations’) to add further variety to the procedures, as the meth 
for developing a rich source of specialized statements for any particul 

lass of problems. 

; In ALGOL, very little attention is paid to the form or the nature 
the data, or to means for successively manipulating large collections 
records for systematic processing. The purpose of the present chapter i 
to study a language that is specifically oriented for such data-handlin} 
and -processing tasks. The “common business oriented language, 
called COBOL, will be used for this purpose, to illustrate methods (1 
for organizing large masses of data and (2) for systematically processi 
such data in successive iterative manipulations. In addition, the stu 
of COBOL will demonstrate (3) the extensive use of expressions Mt 
statements that more closely parallel the clauses, phrases, and stil 
ments of ordinary English usage. 

COBOL, like ALGOL, is intended to be a generally accepted language 
for which compilers are being written that will enable the running of 
single COBOL program on many different computers. In May, 19 
representatives from the government, computer manufacturers, 
industrial computer users met to form the COBOL committee, 
December, 1959, the first COBOL report was completed, and in A 
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1960, the U.S. Government Printing Office published “COBOL—A 
Report to the Conference on Data Systems Languages, Including Initial 
Specifications for a Common Business Oriented Language (COBOL) for 
Programming Electronic Digital Computers.” This report was revised 
in 1961, and a report ‘““COBOL-1961” was prepared. The present treat- 
ment is based on this “COBOL-1961” revised report. However, the 
treatment given in this chapter is of a much simplified version of COBOL- 
1961. The essential concepts involved in the COBOL language can 
casily be buried in a plethora of nonbasic details and refinements. These 
refinements become easy to grasp when the fundamental underlying con- 
cepts are thoroughly understood. Hence we have chosen to simplify 
COBOL wherever we felt that pedagogical enhancement would result; 
we have attempted whenever possible to strip the discussion to its essen- 
tials, omitting many refinements and complicating details. The reader 
should be adequately warned that we are not presenting COBOL-1961 
in this chapter, but rather we are restricting ourselves to the consider- 
ution of its essential concepts; to use COBOL-1961, the reader is referred 
\o the comprehensive treatment of the language given in the “COBOL- 
1961” report itself or to a computer manufacturer’s COBOL manual. 
Ilowever, on consideration of the report or the manuals, it quickly 
hecomes apparent that it is almost always best to approach them with a 
knowledge of the basic aspects of COBOL, as given in this chapter, as a 
prerequisite. 

Characters and Words. The metalanguage symbolism that we used for 
ALGOL (see Sec. 6-1) will also be used in the discussion of COBOL. 
Hence words enclosed within ( ) will be understood as generic terms 
referencing the COBOL language items; the | is again the metalanguage 
‘or’; and ::= will once more be used for “is defined as being.” An 
udditional pair of symbols will also be of some use, namely { }, where 
\he words enclosed are understood to be alternatives. For example in 
COBOL, the operator for add can be denoted by + or by PLUS; hence 


we would write 
+ 
PLUS 


Aw before, direct COBOL words and symbols will appear in boldface. 
The characters of COBOL aret 


(letter) +: = A|B|C|D|E|F|G|H|I|J|K|L|M|N|O|P|Q|R|S|T|U|V|W|X|¥|Z 
(ligit) :: = 0|1]2|3|4/5/6|7/8/9 
(logical value) :;= T\F 


| For our discussion of COBOL, we have chosen to reduce the number of characters 
from that actually used, 
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: - / 
arithmetic, . | + | x | 

( operator / *" errs heads] TIMES| | |DIVIDED BY 

(relational operator) :: = IS GREATER THAN|IS LESS THAN 
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| ie EQUAL | 
(logical operator) :: = AND|OR|NOT 
(delimiter) :: = (|)| +-|12|"” 
(editing symbols) ::= $|*|- 


A word of COBOL is analogous to a variable of ALGOL, except that 
two or more fragments may be combined with a hyphen, -; that is, 





letter) 
(fragment) ::= ace | (fragment) Ree 
(word) ::= (fragment)|(word)-(fragment) 


Note that in COBOL a word can start with a digit as well as a letter, 
The definition of a number for COBOL is exactly the same as that for 
number of ALGOL and will not be repeated. However, unlike ALGOL 
a word or a number in COBOL is Limited to 30 characters. 
In COBOL, several different kinds of words are distinguished, in 
semantic rather than syntactical sense. These are (1) data-names 
(2) literals, (3) condition-names, (4) procedure-names, and Ca 
The “data-name” is the name (or address) of a data-item. T at 
each of the data-items, which themselves never explicitly appear im 
COBOL code but reside on the data tapes, must be denoted by a dat 
name. In COBOL we say that the value of a data-name is a data-ite 
just as in machine language we say that the contents of an address 18 
word, The COBOL statements generally deal only with the data-nam@ 
whose values are operated upon—in the same manner as machine-langua 
instructions deal only with the addresses whose contents are opera 
upon. The data-name must be spelled with at least one alphabetic ch 
acter (letter). A data-name may be hyphenated, such as 


QUANTITY-IN-STOCK or STOCK-NUMBER 


There are occasions when it is desirable to include a “data-item 
explicitly in the COBOL code itself; such data-items are called liter 
In other words, a literal is a data-name whose value is identical to 
characters comprising the name. For example, in the arithmetic exp 
sion 3XY+6, the numbers 3 and 6 are literals, but Y is the name of 80 
other number in the data. Numbers appearing in a COBOL code 
always be interpreted as literals. However, nonnumerical literals mu 
placed within quotation marks so that they can be distinguished from 
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names, e.g., 


“CHAIR” or “FLIGHT-128” 


It is often desired to give names to the literals themselves, and these 
are called condition-names to distinguish them from data-names. Thus 
the value of a condition-name is a literal, which appears explicitly in a 
COBOL code; the value of a data-name is a data-item on the data tapes. 
As an example of the use of data-names, suppose for a certain problem 
we desired A to stand for NEW-YORK, B to stand for TRENTON, 


C to stand for WILMINGTON, and D to stand for BALTIMORE. 
Then we would assign 


NEW-YORK as the condition-name for the literal “A” 
TRENTON as the condition-name for the literal “B” 
WILMINGTON as the condition-name for the literal “C” 
BALTIMORE as the condition-name for the literal ‘“D” 


We shall describe below (see Special-purpose Data Descriptions in Sec. 
7-4) how condition-names are distinguished from data-names in a very 
special manner. 

Unlike ALGOL, in COBOL the individual statements themselves 
cannot be labeled. Instead, only a “block” or “paragraph” of state- 
ments can be labeled, and such labels are called procedure-names. The 
procedure-name may be composed solely of digits, but most often the 
procedure-name spells out some mnemonic for the procedure being 
labeled. 

A verb is a word, associated with a statement, that usually identifies 
the type of statement, such as ADD, GO TO, etc. 
tix a fixed part of the syntax of the statement. 

Summarizing, the data-name is analogous to the variable of ALGOL, 
(he procedure-name to the label of ALGOL; the concept of the literal, 
which was implicit in ALGOL for numerical values, is extended to include 
uny word as well, and the names of literals are called condition-names. 

The variables or data-names may be subscripted, but a maximum of 
only three subscripts are allowed, and the subscripts are limited to only 


integers or data-names and condition-names whose ‘values are integers, 
‘is in, for example, 


The verb appears 


RATE (REGION, STATE, CITY) 
RATE (3, 5, 6) 


If both these forms represent the same element, then the value of 
REGION must be 3, the value of STATE is 6, and that of CITY is 6. 


The use of subscripted variables is the only way of referencing an ele- 
ment of a table, and subseripted variables are interpreted only as refer- 





244 AUTOMATIC-PROGRAMMING LANGUAGES [Cuap. 7 


encing elements of a table (see Special-purpose Data Descriptions in 
Sec. 7-4 for further details). 

Over-all Organization. The problems for which COBOL is designed 
are those that involve the processing of large masses of data. Hence to 
perform a COBOL computation the computer requires two kinds of 
information: (1) the program, in COBOL language, that is to be exe- 
cuted in order to perform the processing ; and (2) the data which is being 
processed (see Fig. 7-1). The actual data is usually repetitive in nature, 
e.g., records for each of, say, 1,000 checks, payroll-record information 
for each of 300 employees, sales-slip records for each of 500 sales, ete. 
























Law Racer DA 
(1) Data \) ae 
bei ~) 
nrbeaseed V storage area 





Procedure 
division 
Data 
division 


Fig. 7-1. Data and program organization. 


(2) Program 
in COBOL 
language 


Depending on the problem, the successive records of the data are recorde 
in predetermined formats. For example, a sales-slip format could be 
illustrated in Fig. 7-2a. The data itself is the information, recorded 
the proper format, for example as in Fig. 7-2b. Of course, when recordé 
on magnetic tape, the information is strung out as in Fig. 7-2c. In or 
that the computer should be able to distinguish the different data-ite 
the computer must be “told” the format, with the different data-ite 
given data-names (see Fig. 7-2d). Hence the COBOL-language progr 
must include (a) a so-called “data division,’’ where the organization 
the data records is specified, as well as (6) a “procedure division,” whe 
the actual processing operations are specified (see Fig. 7-1). Thus t 
data division effectively specifies symbolic addresses to the data-ite 
In this chapter we shall consider both these major divisions of t 
COBOL language. 

Large-scale data processing involves the successive processing of d 
records, one at a time, in sequence. For example, in the case of t 
sales slips, for each sale a tally might be kept of how many items of 
same type were sold; a total or gross income could be kept; a custo 
bill might be made; inventory levels could be adjusted; and so fo 
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When the processing has been completed on one sales-slip record, the 
next record would be read into the computer from the magnetic ta e 
and so forth. Note that, since the data description given in the ase 
division of the program assumes that the data is in a particular pre- 


pecifi d f T ) the d Us f Y J 
6 e ae ata t él, must be initial recor ded on the ma netic 











aT JOHN JONES 1/8/62 
ena 1201 ELM 
ITEM NUMBER ee3 
a JOE BLOW 
(a) (b) 
JOHN JONES 010862 1201ELM 283 $5.98 JOE BLOW. 
(c) 


Data-items on magnetic tape 


»» DOE|JJOHN JONES 010862 1201 ELM 283 $5.98 JOE BLOW] MARY LEE 8 6 61 


- ——” 
eee | CUSTOMER ADDRESS ITEM AMOUNT SALESMAN | ave 


Data-names of code 


— 
DATE 


Via, 7-2. (a) Illustration of a sales-sli 

: -slip format; (b) data recorded in th les-sli 
format of (a); (c) data strung out for recording on magnetic tape; (d) dinateatis ae 
relation between data-names and data-items. “ee 


COBOL differs from ALGOL in two fundamental ways: The first is 
that COBOL is specially set up to perform the same type of processin 
repeatedly for successive sets of data. Hence the data-handling as oe 
of COBOL become of prime importance. The second concerns the fact 
that the record data itself is distinct from the COBOL-language pro- 
gram or code that is to process it, and is not written as, or cider 
to be, part of the code. Only data in the form of literals oie be part of the 
code itself. Separating the data from the code could have the serious 
disadvantage that the data could not in any way control the processin 
heing done by the code. To avoid such a disadvantage, the concept of 
the condition-variable is introduced. A éotidition-variable isa aaa 
whose value is limited to certain specified values, called option-values 
lor example, let FACTORY be a condition-variable, and suppose that 
only four factories, denoted by A, B, C, and D, are invuived in the rob: 
lom, Then if each record on the tape is associated with a particular fac- 
bory, each will contain a particular option-value (A, B, C, or D) for the 
wondition-variable FACTORY, — If the code is to perform a different bet 
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essing for each factory, then (the “eontents’” of) FACTORY would have 
to be compared with the literals “A”, “B”, “C”, and “D”, which must be 
written in the code itself, and jumps would be made to different sections 
of the code in accordance with the outcomes of the comparisons. Thus 
the data on the magnetic tape can influence the processing performed by 
the code. Often the condition-names of the literals are used instead of 
the literals themselves. For instance, (the “contents” of) FACTORY 
(i.e., the data-item on the magnetic tape) would be compared with (the 
“contents” of) NEW-YORK—i.c., a literal given in the COBOL-language 
code—and so forth. In this way a bridge is made between the data on 
the magnetic tape and a code in COBOL language, by means of the 
condition-variable and condition-name (or literal) constructs. 

Procedure and Data Divisions. A code written in the COBOL language 
is composed of two parts, the procedure division and the data division. 
The procedure division contains the operational statements, whose exe- 
cution performs the processing. The data division contains the specifi- 
cations for and assignments of the data-names, the condition-names, the 
literals, and other structures to be discussed below. The procedure 
division involves statements that are similar to those of ALGOL; it will 
be considered in Sec. 7-2. The data-division description embodies the 
new concepts contributed by this chapter and is considered in Secs. 7-3, 
7-4, and 7-5. In See. 7-6 the data-handling methods unique to COBOL 
(as compared with ALGOL) are considered, and this involves both the 
procedure and data divisions; at the end of that section a complete 
COBOL code is illustrated. 

In the procedure division, the statements in COBOL (which in ALGOL 
were demarcated by semicolons) are placed together to form sentences, 
analogous to the compound statements of ALGOL. Hence a sentence 8 
composed of one or more statements separated by semicolons and ends with a 
period. A group of successive sentences that may be related in function 
form a paragraph, which is written with a physical separation. Only @ 
paragraph can have a label, called a procedure-name. This label is placed 
as a heading to the paragraph (see page 289 for an example). 

The data division is concerned with the record, file, environment, an 
identification descriptions. The records are the basic data-items being 
processed; collections of records are called files. By “environment” | 
meant the physical complement of input-output equipment available fo 
the particular problem, as well as the assignment of files to these equips 
ments. The identification description consists of a simple identificatio: 
of the code and file tapes. : { 

+ COBOL actually requires four divisions: the procedure division, the data division 
the environment division, and the identification division, We have chosen to lu 
these last three together, calling them collectively the data division, 





















Ssc. 7-2] DATA-PROCESSING LANGUAGES: COBOL 247 


7-2. Operational Statements 


Boolean and Arithmetic Expressions. The form and nature of the 
Boolean and arithmetic expressions of COBOL are analogous to the 
simple forms of the corresponding expressions of ALGOL, but words are 
sometimes used for relational and arithmetic operations. The relations 
of COBOL, however, are not limited to comparisons of magnitudes of 
numbers but include comparisons of words as literals as well. Here ‘IS 
GREATER THAN,” “IS LESS THAN,” and “IS EQUAL TO” for words 
are understood as if the words involved were numbers with the radix 26. 
lor example, “ABAA” IS GREATER THAN “AAAD” is true; and 
“DOG” IS EQUAL TO “DOG” is true [where we have put the dis- 
tinguishing quotation marks (‘, ”) around the literals]. For words of 
unequal length, the right-hand part of the shortest word is filled with 
“blanks” to make the words of equal length, with the blank considered 
smaller than any letter. Thus “ABAA” IS GREATER THAN “ABA” 
is true. In general, condition-variables and condition-names (that is 
words whose values are respectively data-items and literals) are éften 
used in the relations, rather than the literals themselves. For instance 
if it were desired to determine whether or not a report were from the Naw 
York factory, a comparison of the type (the ‘contents of?) FACTORY 
IS EQUAL TO (the “contents of”) NEW YORK would be made, where 
FACTORY is a conditional-variable and NEW YORK is a oondiviene 
name. For relations, then, the operations involved are 


(relational operation) ::= IS GREATER THAN|IS LESS THAN 


ie EQUAL | 
Ilence a relation can be given the following syntactical definition: 


(relational operand) :: = (literal)|(data-name)|(arithmetic expression) 
_ /relational\ /relational\ /relational\ 


(relation) ::= 
) \ operand / \operation/ \ operand / 


Of course both operands cannot be literals, or the test would be trivial; 
rather, if one is a literal, the other must be a data-name of a literal which 
is looked up (evaluated) for the comparison. 

‘The Boolean expression, as in ALGOL, is a rule for computing a logical 
value, true or false. Boolean expressions of relations involve logical oper- 
‘tions, namely, 


(logical operation) :: = OR|AND|NOT 


and can be formed like the simple Boolean expression of ALGOL—with 
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the same precedence order of operations, namely, 


First. Arithmetic expressions 
Second. Relational operations 
Third. NOT 

Fourth. AND 

Fifth. OR 


The COBOL language allows certain displacements and omissions of 
words, in order that it correspond more closely to customary English 
usage. The negation, or NOT, is usually placed after the IS in a relation; 


e.g., 
NOT (A IS LESS THAN B) 


becomes 
AIS NOT LESS THAN B 


The left operand can be omitted from a relation in instances where it 


would normally be understood, such as where 


AIS LESS THAN B OR AIS EQUAL TO C 


becomes 
AIS LESS THAN B OR EQUAL TO C 


Such omissions can be compounded, as in 
AIS EQUAL TO B AND C ORIS LESS THAN D 


The general rule used in COBOL in all cases is that the compiler takes 
the missing parts from the nearest preceding full relation which explicitly 


states all three parts in the same sentence, no matter what semicolons, — 


parentheses, connectives or other words, etc., may lie in between. This 
rule implies that the first relation of a sentence must have all the parts 
explicitly stated (why ?). 

Arithmetic expressions can likewise be expressed in terms of words for 
the operations, although here the symbols may be used if desired. The 
order of precedence of the operations is the same for COBOL as for 


ALGOL: 


a 





Order Symbol + Word equivalent 
Bil ceieess tad on EXPONENTIATED BY 
Second......... x, / | TIMES, DIVIDED BY 
Third..........| +, — | PLUS, MINUS 


es 
+ COBOL actually uses ** instead of T and * instead of X. The relation 
EQUAL TO can as well be symbolized by =, as we have seen, 
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The arithmetic expression is, as in ALGOL, a rule for computing a numeri- 
cal value; its syntactical definition for COBOL is identical to that of the 
simple arithmetic expression of ALGOL. 

Assignment Class of Statements. Corresponding to the assignment 
statement of ALGOL, there is a class of statements of COBOL. The 
first is called the COMPUTE statement and has the following syntax: 


FROM 
(left part compute) :: = COMPUTE (data-name) = 
EQUALS 
Crees ::= (left part compute) (arithmetic expression) 


Here the words FROM and EQUALS are equivalent to each other and to 
=. The effect of the statement is to evaluate the arithmetic expression 
and assign its value to the data-name of the left part. (When the arith- 
metic expression is itself just a data-name, the effect is to assign its value 
to the data-name of the left part.) For example, 


COMPUTE PRICE FROM COST TIMES (ITEMS MINUS DISCNT) 


would multiply the value of COST by the value of ITEMS — DISCNT 
and assign the resulting value to PRICE. 

The term COMPUTE is called a verb of COBOL, for it describes an 
action to be performed. Another verb in the assignment-statement class 
is ADD; the ADD statement has the following syntax: 


(addend) :: = (literal)|(data-name) 

(where of course the “literal” must be a number) 

(idend s:= (addend) (*422"9\, (addend) 
/addend\ 


(left part add) :: = ADD (addend), 


add \ 
\statement/ 


\ list / 


::= (left part add)|(left part add) | (data-name) 


TO 
GIVING 
‘Thus the ADD statement has three forms: (1) the first has just a list of 
(wo or more addends (why?); (2) the second has a “left part add” with 
the word TO followed by a data-name; and (3) the third has a “left part 
udd” with the word GIVING followed by a data-name. The compu- 
{ations to be performed for each case are as follows: (1) The sum of 
the values of all the addends is formed and assigned as the value of the 
rightmost addend (which therefore must be a data-name), (2) The sum 
of the left-part addend values is added to the data-name following TO 
and becomes the new value of that data-name, (3) The sum of the 
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he data-name following the 
left-part addend values becomes the value of t 

GIVING. For example, to add the PRICE and the SHIPPING costs 
and a sales tax of $3.85 forming the TOTAL, we would have the following 


for each of the forms: 


ADD PRICE, SHIPPING, 3.85, TOTAL. 
ADD PRICE, SHIPPING, 3.85 TO TOTAL. 
ADD PRICE, SHIPPING, 3.85 GIVING TOTAL. 


In the first two cases, the value of the TOTAL would be set to zero 


initially; this is not necessary for the third case. _ 
oe of the statements of the assignment class are similarly con- 
structed. Thus SUBTRACT is syntactically defined as 


/addend\ 
(left part sub) ::= SUBTRACT (addend)|SUBTRACT Mc ist 7 


/ subtract \ .. _ (left, part sub) FROM (data-name) 
\statement/ (left. part sub) FROM (addend) GIVING (data-name) 


The value of the addend, or of the sum of addends, of the “‘left part ~ 
is subtracted from the value of the data-name or addend following t 
FROM, and this difference becomes the new value of the date a 
For example, to form an INVOICE as the PRICE less the DISCO 


and a eredit of $3.85, we have 
SUBTRACT DISCOUNT, 3.85 FROM PRICE GIVING INVOICE. 


For MULTIPLY, we have 


/ multiply \ -:= MULTIPLY (addend) BY (data-name) 
\statement/ [MULTIPLY (addend)BY(addend)GIVING (data-name 


Similarly for DIVIDE we have 


/ divide \ .._ DIVIDE (addend) INTO (data-name) 
\statement/ "”  pryIDE (addend) INTO (addend) GIVING (data-na 


For example, suppose we wished to find the PRICE of 36 ITEMS liste 
at $150 per dozen, where we get a 10 per cent DISCOUNT: 


DIVIDE 12 INTO ITEMS GIVING DOZENS; 
MULTIPLY DOZENS BY 150 GIVING COST; 
MULTIPLY COST BY DISCOUNT GIVING AMOUNT; 
SUBTRACT AMOUNT FROM COST GIVING PRICE. 


GO TO and ALTER Statements. These COBOL statements are an 
gous to the ALGOL go to statement and switch declaration, Cons 
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first the COBOL GO TO statement, which has two forms: 


(label list) :: = (abel)|(label list), (label) 
/ goto \ .._ 
\statement/ ** GOTO Marey label\ 
|GO TO (label), ( Jin) DEPENDING ON (data-name) 


The first form is clear. In the second form, there is a choice of two or 
more destinations. The value of the data-name must be a positive inte- 
ger between 1 and N + 1, where N is the number of labels in the label list; 
then the first label corresponds to 1, and the labels of the label list in 
order from left to right correspond to the numbers 2 through N + 1. 
The effect of the GO TO statement will be to jump to that paragraph 
which corresponds in order number to the value of the data-name. 
The ALTER statement is as follows: 


(alter statement) :: = ALTER (label) TO PROCEED TO (label) 


The ALTER statement acts only on a GO TO statement with a single 
destination label. Its effect is to change the destination. The GO TO 
statement to be changed must form a paragraph by itself, so that it may 
have its own identification label. Then the label after the word ALTER 
is the identification label of such a GO TO statement; the destination 
lubel of the GO TO statement is replaced by the label that follows the 
words TO PROCEED TO in the ALTER statement. 

Conditional Statements. The conditional statement of COBOL is of 
u slightly different form from that of ALGOL. The syntax is 


(if statement) ::= IF (Boolean expression) THEN (statement) 
(if fragment) :: = <if statement)|(if fragment); (statement) 

(else statement) :: = ELSE (statement) 

(else fragment) :: = (else statement)|(else fragment) ; (statement) 
conditional\ |. 


ant yo (if fragment) (else fragment) pone 

The “empty” alternative is used only when another “else fragment” 
follows (i.e., when the conditional statement is the final “statement”’ of 
wn “if fragment’’—see example below) ; otherwise the period must be used 
(0 indicate the final statement of the “else fragment” (why?). Note that 
unlike ALGOL the ELSE must occur; that unlike ALGOL the “if state- 
ent’? may include a conditional statement; that unlike ALGOL the 
Is}oolean expression and other expressions occurring in the statements 
eannot include an IF (why?); and that unlike ALGOL the statements 
Within the conditional statements cannot be labeled (why?). Because of 
(hese first two differences, the interpretation of conditional statements 
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Fig. 7-3. Flow-chart interpretations of conditional statements (see text). 


which involve conditional statements is 


quite different from that 


ALGOL. For example, the following conditional statement is inte 


preted as in Fig. 7-3a (cf. Fig. 6-4): 


IF B1 THEN $1; IF B2 THEN S2 ELSE S3 ELSE S4. S6, 


if statement | 


conditional statement 2 else statement 1 


t ——————— | 
else fragment 1 


if fragment 1 


re ereTITGS? 


conditional statement 1 


Figure 7-3b is the interpretation of the following: 


IF Bi THEN S1; IF B2 THEN S2 ELSE 


S3 ELSE S4; 
IF B38 THEN S65 ELSE S6. 
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Figure 7-3c is the interpretation of the following: 
B2 true 
B2 false 





diye Raven ts eal 
IF B1 THEN IF B2 THEN S1 ELSE S2 ELSE IF B3 THEN S3 ELSE S4. $5 


B2 false 
B3 false 





B1 false 


That these results read something like English can be seen from this 
example: 


IF CUSTOMER-CREDIT IS EQUAL TO GOODRISK THEN 
IF CUSTOMER-YRS IS GREATER THAN 5 THEN 
COMPUTE DISCOUNT = OLDDISC 

ELSE COMPUTE DISCOUNT = NEWDISC 

ELSE GO TO BADCUSTOMER. 


Ilere CUSTOMER-CREDIT can have the value GOODRISK or BAD- 
RISK; CUSTOMER-YRS has the value of the number of years the 
person named has been a customer; the compute statements fix the value 
of the discount; BADCUSTOMER is the label of a paragraph that 
describes how to deal with bad customers. 

for Statement. The loop repetition, or for statement, of COBOL is 
actually denoted as PERFORM. The syntax is 


(left part perf) :: = PERFORM (label)|PERFORM (label) THRU (label) 
(from list) :: = FROM (addend) BY (addend) UNTIL (relation) 


ioe i ::= (empty)|(addend) TIMES|UNTIL (relation) 
oer |\VARYING (data-name) (from list) 


Geaaene has ) ::= VARYING (subscript-name) (from list) 
/ subscript tally\ AFTER Gases, (from list) 
condition / \ name 
/simple tally\ 
\ condition / 


bscript tall 
tpt pot (mea) 


( perform \ _. wl Rati park pati) 


atatement/ ** 


These syntactical rules give five options to the PERFORM statement, 
the first four associated with the four “simple tally condition’’ alterna- 
lives, the fifth concerned with the “subseript tally condition,” 
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Consider first the “empty” option of the simple tally condition: the 
PERFORM statement could then be of the form 


PERFORM LABELX THRU LABELY. 


which would simply tell the computer to jump to LABELX, to execute 
the statements of that paragraph and the next successive paragraphs 
until the statements of paragraph LABELY were executed, and then to 
jump back to the original program—no loop or iteration is called for here. 
Second, if the PERFORM statement is of the form 


PERFORM LABELX THRU LABELY N T IMES. 


then the sequence of paragraphs from LABELX through LABELY woul 
be repeatedly executed exactly N times—here we have a simple loo 
Third, the form 


PERFORM LABELX THRU LABELY UNTIL A IS LESS THAN B. 


tells the computer to execute the sequences of paragraphs repeated 
until the relation A IS LESS THAN B (in this case) is true. This 
directly analogous to the while alternative of the ALGOL for statemen 
Fourth, we have the anviogy to the step-until option of ALGOL, which 
COBOL is really a combined step-until and while statement, of the fo 


PERFORM LABELX THRU LABELY VARYING DANA FROM 
FIRSTN BY DELTA UNTIL A IS LESS THAN 


Here the variable DANA is explicitly displayed, as are the initial ¢o 
dition FIRSTN, the increment DELTA to be added to the variable af 
each iteration, and the exit condition (in this case, “A IS LESS TH 
B’). The fifth option involves the use of subscripted variables for 
formation of loops within loops. Since in COBOL subscripted varia 
are limited to only three subscripts, a maximum of three “from lists” 
be involved in the ‘subscript tally condition.” As an example, lig, 7 
shows the loops generated by the following statement: 


PERFORM LABELX THRU LABELY 
VARYING SUBS1 FROM FIRSTS1 BY DELTA1 UNTIL (relation 
AFTER SUBS2 FROM FIRSTS2 BY DELTA2 UNTIL (relation 2) — 
AFTER SUBS3 FROM FIRSTS3 BY DELTA3 UNTIL (relation 3), 


Procedure Statements and Declarations. Two statements of CO 
are used to incorporate subroutines into a program: the INCL 
statement and the DEFINE statement. The INCLUDE, to which 
ALGOL procedure statement is the most closely related, has this 
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tactical form: 

(replacement) ::= (data-name) BY (data-name) 

ee list) :: = (replacement)|(replacement list), (replacement) 

‘ include \ |. _ 

Vtuaner oe INCLUDE (label), REPLACING (replacement list) 












Set SUBS1 to FIRSTS1 
SUBS2 to FIRSTS2 
SUBS3 to FIRSTS3 


< relation 1> 


F T 


<relation 2> 






Exit 


< relation 3> 





















Execute 
LABELX through 
LABELY 


Set 
SUBS3 to 
FIRSTS3 


Form new SUBS2 
= SUBS2+DELTA2 


Set 
SUBS2 to 
FIRSTS2 


Form new SUBS1 
=SUBS1+DELTA1 














Form new SUBS3 
=SUBS3+DELTA3 






iu. 7-4. Use of subscripts to form loops by means of the PERFORM statement. 


The concept of a subroutine in COBOL is simply that of a labeled para- 
yraph. The INCLUDE statement calls for the subroutine by its label; 
then before the subroutine can be executed by the program, the vat 
ables or data-names of the subroutine corresponding to “independent”? 
ov “dependent” variables must be replaced by the corresponding data- 
Humes of the program. This correspondence is given in the “replace- 
mont list,” in which the left-hand data-name of a “replacement” is the 
ame of a variable in the subroutine (as originally written) that is to be 
placed by the right-hand data-name of the program, This is analogous 
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to the role of formal and aciual variables in ALGOL. As an example, 
consider the following subroutine, labeled TAKE-DISCOUNT: 


TAKE-DISCOUNT. MULTIPLY DISCOUNT-RATE BY COST 
GIVING DISCOUNT; SUBTRACT DISCOUNT FROM COST 
GIVING NET. 


The following program is an example of how TAKE-DISCOUNT might 
be used: 


MULTIPLY NUMBER-ORDERED BY ITEMCOST GIVING GROSS; | 
INCLUDE TAKE-DISCOUNT, REPLACING DISCOUNT RATE BY 
0.05, COST BY GROSS, NET BY INVOICE; ADD INVOICE, TOTAL. 


Here the program used the terms GROSS and INVOICE, which corre 
sponded to COST and NET of the subroutine. The most important 
aspect of the replacements is not just the substitution of terms, but i 
rather the fact that such a substitution of terms means that the valu 
utilized in the subroutine execution, as called for by an INCLUDE stat 
ment, will be those of the program, as desired. 

The DEFINE statement presents the opportunity for defining 
statements in a prescribed format. The INCLUDE statement affords 
means for using a subroutine, but the INCLUDE statement format mu 
be utilized. With the DEFINE statement, the format of a new stat 
ment may be defined, and then this new statement itself would be used i 
a program to jump to the subroutine. Hence the English connotatio 
of the subroutine use can be included in its “calling-up” statement. T 
DEFINE statement, which defines this format, is therefore really a t 
declaration rather than a statement; it declares the format of a new sta 
ment whose interpretation will be by means of the subroutine of t 
declaration. The syntactical definition of the DEFINE statement is 


(format) :: = FORMAT (verb-name) (verb structure) 


< ee » -: = DEFINE (verb-name) WITH (format) (verb subrouti 
where by “‘verb-name”’ we mean the identification given to the new ver 
analogous to the verbs PERFORM, ADD, SUBTRACT, etc.; by “ve 
structure” we mean the syntactical definition of the new verb; and 
“verb subroutine” we mean the subroutine that is called for when the ni 
verb is used in a program. The independent and dependent variabh 
of the verb subroutine must appear somewhere in the “verb structure, 
The verb structure is written out in full, using some generic identifieath 
for these variables, and the verb subroutine will use these generic iden 
cations also. When the new verb is used in a program, the gen 
identifications will be replaced by their actual values; when the jump 
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the new verb subroutine is made, the generic identifications are replaced 
by the corresponding actual values from the program. 


7-3. Over-all Data Organization 


Orientation. The operational statements of COBOL were in general 
analogous to those of ALGOL; however, it is in the organization and 
handling of data that the primary purpose of COBOL becomes apparent. 
The data-organization constructs are of central importance in the COBOL 
concepts. In the over-all view of this data organization, it is assumed 
that magnetic-tape units are used for storing the files and associated 
records being processed (see Fig. 7-5). In general, the data itself con- 
sists of lists of records, where there is one record of information for each 
employee, sale, invoice, inventory item, production item, individual medi- 
cal record, etc., under consideration. Each type of record used is given a 
fixed record format, such as that illustrated in Fig. 7-6. The record itself 
consists just of the information (as, for example, in Fig. 7-7), which 
becomes meaningful only when the record format is known. A file is a 
collection of similar records, e.g., the collection of records for all the 
employees of a certain plant, or for all the sales in a fiscal year, or for 
all the monthly invoices of a business, or for all inventory iteme of a 
particular warehouse, or for all items produced by a plant in a week 
or for all the medical records of a hospital for a year, etc. Each file 
will contain only records of a single type (i.e., format), and each file is 
stored on its own separate magnetic tape. t 

Some of the files are input files: that is, they supply the initial data 
to be updated or processed by the computer. The processed informa- 
tion is dumped onto output files (see Fig. 7-5). For example, in a payroll 
processing job, the input files will contain the records of the employees 
with the record for each employee containing all the information feéded 
to make up his paycheck. The output files would be the records of 
what information should be printed on each check and would also be 
ew employee records if some items such as “total earnings to date” 
must be updated. As a general rule, in updating a file a new file tape is 
made, rather than the old file tape changed. This is done in case of an 
error, so that the old and new records may both be available. At some 
luter time, the old file can be erased and the magnetic tape reused. 
Another example of this is the updating of inventory records, for which 


| Actually, each file may contain several different types of records; and it is pos- 
sible to have more than one file on the same magnetic tape, although it is More com- 
ion to have more than one tape associated with the same file. These generalizations 
are easily made at a later stage in the discussion, for they add no new concepts to our 
exposition, Hence for the present we shall consider the simplest case of a single type 
of record in a file and one magnetic-tape unit for each file under consideration. 
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new, revised inventory-record file is made, on a new output tape, from 
the old inventory tape and stock-record changes as input files. 

Flow of Information during Processing. The dynamics of a large-scale 
data-processing problem of this type are as follows: First, note that in 
xeneral the same kind of processing is to be performed repeatedly on 
cach record of the same type. For example in the payroll case, for each 
employee, say, on a regular salary, the deductions for social security, 
l'ederal income-tax withholding, state income-tax withholding, health 
insurance, etc., must be made, according to the employee’s total earnings 
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tion. 


\o date, number of exemptions, type of insurance policy, etc. Thus the 
sume computer code will successively process each employee’s record; the 
record must be updated with total earnings to date, with the paycheck 
(uta and deductions listed on separate output files. Hence the individual 
‘omputer records are processed one at atime. For such successive com- 
putations, an employee’s record is read into the high-speed memory of the 
‘omputer from the input-file tape; the processing is performed on this 
ecord and the several output records are generated; next these are read 
vl of the high-speed memory of the computer onto the proper output- 
lilo tapes; then the computer is ready to read in the next employee’s 
ecord and repeat the processing (see Fig. 7-8).t The computer or the 


| Of course it may be more efficient to read (write) more than one record at a time 
from (onto) the magnetic-tape files, and this is actually done. However, the pro- 
sowing clearly will in general involve one record at a time from a file, and hence 
Without loss of generality we may talk as if one record at a time is utilized from a file: 


fideed, the COBOL instructions are set up to maintain this point of view (see the 
RHAD and WRITE statements in Sec, 7-6), 
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COBOL code must therefore contain (1) a part that performs the oa 
essing on a record and (2) a part that reads in the next sae e 
instructions for the former part have already been discussed in t c “a 
ous section; the instructions for the latter data-handling operation wi bes 
described in a later section. The purpose of this and the next i 
is to describe how the computer is told about the format of the recor! , 
(the record description) and about the files or tapes at its disposal (the file” 
and environment description) —i.e., to desezibe the data-organization. 










Read records 
from input files 












Process records, 
i.e., generate records 
for output files 


Prepare for 
next record 








Write records 
onto output files 






Have all records 


Exit 
been processed? 


Fic. 7-8. Over-all view of the dynamics of successive record processing. 


Consider next the high-speed memory of the computer. As each 
the records is read in from an input file, one record at a time, it must 
located somewhere in the high-speed memory. Similarly, after an ow 
put record is formed, but before it is written onto its file, it must 
stored somewhere in the high-speed memory. In addition, ra } 
processing certain other areas of the high-speed memory must a 
cated for tables, constants, intermediate record results, etc. 
groups of areas (see Fig. 7-5) are called, respectively, the iit are 
the output-file areas, and the working-storage areas. The ona m 
be told where and how much space to allot to these areas (by the ree 

ing-storage descriptions). 
srr arene The description of a type of record format 
accomplished in the data division of the COBOL language. F ( YW cou 
only one record description for each different type of record is nee 
A record description tells the computer such things about the ore 
zation and nature of the record as the categories and subcategories, 
data-names for these categories and subcategories, the size of the ach 















(he higher the number, the deeper the structure. 
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data field (i.e., the number of bits or characters a data-item will require), 
the class or type (e.g., numerical, alphabetical, alphanumerical) of each 
item, and so forth. The computer requires this knowledge in order (1) to 
allot the necessary space in the input- and output-file areas for the files, 
(2) to locate data by its data-name, and (3) to perform the proper type 
of operations (e.g., arithmetical, logical, etc.). Note from Fig. 7-5 that 
the record descriptions are always kept in the COBOL area of the high- 
speed memory, whereas the data-records themselves are on the file tapes, 
being read into the high-speed memory from each tape, one at a time, 
only when their processing is taking place. 

The format for writing a record description (which is itself a format) 
is as follows:t 


Sena, ::= (level number) (data-name); (data description). 
(record description) ::= (data-item description)| 


(record description) 
(data-item description) 


The remainder of this section and the next section are devoted to dis- 
(ussions of the terms mentioned in this syntactical definition. However, 
it might first be of interest to observe the following example of a record 
deseription : 


1 MASTER; SIZE IS 22 CHARACTERS; CLASS IS ALPHANUMERIC. 
2 BDG-NBR; SIZE IS 7 CHARACTERS; PICTURE IS AAXX999. 
2 DATE; SIZE IS 6 CHARACTERS; CLASS IS NUMERIC. 
3 MONTH;; SIZE IS 2 CHARACTERS. 
3 DAY; SIZE IS 2 CHARACTERS. 
3 YEAR; SIZE IS 2 CHARACTERS. 
2 PLT-NBR; SIZE IS 3 CHARACTERS; CLASS IS NUMERIC. 
2 GROSS-PAY; SIZE IS 6 CHARACTERS; PICTURE IS 999.99. 


wach row of this example represents a ‘‘data-item description’’; the left- 
ost digit is the “level number,” the leftmost word is the “data-name,” 
und the rest is the “data description.” 


‘lhe level number is used to arrange the data in a hierarchical structure: 
Thus level number 1 


{ In this brief and introductory treatment we must necessarily omit some of the 


fooord-description entities; however, we hope to treat here the basic concepts involved, 
with further details to be obtained from the COBOL manual itself. 
pal notation, aligned vertical juxtaposition of generic entities will imply a columnar 
Hinting, 


In our syntacti- 


Thus 


(rooord description) :; @ (record description) 


(dataeitem description) 


Henna that a record deseription ia a listing of a column. of datacitom descriptions, 
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gives the name of the record itself, such as MASTER, TIME-CARD, 
BILL-OF-SALE, PROCUREMENT-ORDER, and so forth. The num- 
ber-2 levels give categories of data within the record; the number-3 levels 
give categories of data within the number-2 level under which they are 
listed; the number-4 levels give categories of data within the number-8 
level under which they are listed; and so forth. Thus the record 
MASTER (see above) has four subcategories, namely BDG-NBR, DATE, 
PLT-NBR, and GROSS-PAY. The DATE itself has three subcategories 
MONTH, DAY, and YEAR. As an example of an actual record as 1 
would appear on magnetic tape in the format described above, we couli 
have 


AUTOMATIC-PROGRAMMING LANGUAGES 


Elementary data-item Data-names 


BQY7123 BDG-NBR 
06 MONTH 
28 DAY DATE MASTER 
61 YEAR 
284 PLT-NBR 
236.65 GROSS-PAY 


where the left-hand column is the actual record data and the boldf ( 
words are the respective data-names. Of course, on the magnetic tw 
the data of a record just appears as a string of letters and numbers, 1. 


(previous record data) BQY7123062861284236.65(next record data) 


and the computer, by means of the COBOL compiler, would count { 
characters to locate the individual items of data. In general, these im 
vidual items of data are called the elementary data-items. 

Very often the data-names of elementary items cannot be used diree 
but must be qualified by the data-names of lower-numbered (i.e., hig 
levels. Such cases arise when an elementary data-item has the s 
data-name as another elementary data-item in either the same record 
another record that will be used in the high-speed memory at the 


time. For example, consider the following record descriptions: 
1 MASTER .... 1 NEW-MASTER .... 
2CURRENT-DATE .... 2 CURRENT-DATE.... 
3 MONTH .... —3 MONTH .... 
3DAY.... 3. DAXis bce > 
3YEAR.... 38YEAR.... 
2 LAST-TRANSACTION- 2 LAST-TRANSACTION- 
DATE.... DATE. A +s5ust 
3 MONTH .... 3 MONTH .... 
S8DAY.... S8DAY.... 
3YEAR...- S Va eit 
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To avoid ambiguity the MONTH contained in the CURRENT-DATE of 
the NEW-MASTER record (see arrow above) must be referred to as 


MONTH IN CURRENT-DATE IN NEW-MASTER 


This entire phrase would be used for the data-name of the elementary 
data-item with the arrow above, where each IN is followed by a lower- 
level-number data-name, successively as illustrated, until no ambiguity 
can exist. In any case where no ambiguity in data-name can exist, the 
data-name of the elementary data-item is used directly. 


7-4. Detailed Data Descriptions} 


Description of Physical and Operational Characteristics. We come now 
to a discussion of the ‘data description” part of the record description 
(see the syntactical definition above). So that the computer will know 
what to expect as it goes through the actual record data, as well as how to 
handle the different types of data, certain characteristics of the data can 
he described, with the descriptions separated by semicolons. The follow- 
ing can be specified concerning the physical and operational character- 
inties of an item of data:t 


SIZE PICTURE CLASS SIGNED USAGE 
‘The SIZE clause is specified as 
SIZE IS (integer) CHARACTERS 
where “integer” gives the actual number of characters of the data being 


described. If the data being described is distributed over deeper (higher- 
iumbered) sublevels, then the total number of characters in these levels 
js given. For example, if DATE is composed of MONTH, DAY, and 
YEAR, each of which has two characters, then for DATE the SIZE IS 
6 CHARACTERS. ‘This concept of data size is generalized to any num- 
hor of sublevel categories. 

The PICTURE clause can be given only for an elementary data-item 
und is simply a detailed format of its composition in terms of types of 
characters. In the PICTURE, A represents an alphabetical character, 
X an alphabetical or numerical (alphanumerical) character, 9 a numerical 
vharacter, S the + or — sign, . an actual decimal point, B a blank space, 
and $ an actual dollar sign.t Then, for example, 


PICTURE IS AAAXXX999 


| Since our primary aim is to treat concepts only, we shall omit from discussion many 
dotails and deseriptive parts that are not thought pedagogically important for the 
student at this point, Once the concepts are known, these details and omitted parts 
ny be easily obtained from the COBOL manual itself, 
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represents data consisting of (from left to right) three alphabetical char- 
acters, three alphanumerical characters, and three numerical characters. 
Similarly 

PICTURE IS 999.99 


represents a number composed of three numerical characters to the left, 
two to the right, of the decimal point. 

The CLASS clause indicates the type of data being described and can be 
ALPHABETIC, NUMERIC, or ALPHANUMERIC. It is written, for 
example, as 

CLASS IS NUMERIC 


If the PICTURE is given, then the CLASS is unnecessary. The CLAS 
description will extend over all higher-numbered associated levels. Th 
if for DATE we have CLASS IS NUMERIC, then we need not repeat t 
for the higher-numbered levels of MONTH, DAY, and YEAR. If, ont 
other hand, the MONTH were to be spelled out in the data, then t 
CLASS would have to be given at the MONTH level (why ?). ( 

If a numerical elementary data-item is to have a sign associated with i 
then the word SIGNED is included in its description. If the value 
some other elementary data-item is to be the sign of the item bei 
described, then SIGN IS {data-name) is written. A signed item must 
NUMERIC, and therefore its class need not be given; also, if a pictu 
that contains an S$ is given, then SIGNED is not necessary. 

The syntax of the USAGE clause is as follows: 


(usage type) 2: = COMPUTATIONAL-(integer)|DISPLAY -{integer) 
(usage clause) :: = USAGE IS (usage type) 


The purpose of the usage clause is to describe whether the data is to ta& 
part in numerical computations or is to behave as a word to be displa. 
or “read.” The “integer” part is to be a code to indicate the vari 
types of usages, within COMPUTATIONAL or DISPLAY, that 
require special operational handling by the computer. For example, f 
COMPUTATIONAL usage, 1 might represent a binary number, 2 a de 
mal number, 3 a floating-point number, and so forth. If the usage 
COMPUTATIONAL, then CLASS IS NUMERIC is understood and 1 
not be explicitly stated. 

Special-purpose Data Descriptions. There are some important spedl 
purpose data descriptions of COBOL which we shall now consi 
namely, 


VALUE OCCURS REDEFINES RENAMES 


The VALUE clause is used to describe the value of a condition 
Recall that a conditional-variable ean have several possible opt 


Sy 
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values—for instance, the conditional-variable FACTORY might have 
the option-values A, B, C, and D if the business in question had four 
different factories—and that it is frequently convenient to give names 
to each option-value, called ‘“condition-names.’”’ For instance, for the 
Spoons ee yg of each of the option-values A, B, C, and D might 
0 the name of the city in which each factory is locat - 
YORK, TRENTON, WILMINGTON, and heaton: os ea teve 
we may say that the VALUE of New York IS “A”, the VALUE of Trenton 
IS “B”, and so forth, where quotes (“, ”?) are used around the “value- 
literal.” Of course, the conditional-variable and its possible condition- 


names and their respective values must be stated in the record description. 
his is done through the following syntax: 


(condition-value entry) ::= 88 (condition-name), VALUE IS “(literal)”. 
(condition-value list) :: = (condition-value entry)| 
{condition-value list) 


(condition-value entry) 
condition \ |. 


\desoriptions (level number) {condition-variable) (data description) 
(condition-value list}) 


lor example, suppose that FACTORY is a level-2 item; then for the 
vondition description we would have 


2 FACTORY; SIZE IS 1 CHARACTER; CLASS IS ALPH 
ANUMERIC. 
88 NEW-YORK, VALUE IS “A”. " 


88 TRENTON, VALUE IS “B”. 
88 WILMINGTON, VALUE IS “C”. 
88 BALTIMORE, VALUE IS “D”’. 


Note that the SIZE is given as the size of the value of the condition 
variable, that is, the number of characters that a single value requires 
lhe “level number” 88 is reserved solely for the purposes of displaging 
ullernative condition-names and their associated values and has no true 
lovel-number meaning. 

The OCCURS clause is used for defining data tables or arrays. The 
(ata-names of the values in the table or array are subscripted variables 
wid the subscripts specify exactly which values of the table or array of 
the data-name are desired (like matrices). The OCCURS clause is 
written as 


OCCURS (integer) TIMES 


and can never be associated with a record-name, i.e., with a level-1 data- 
lem description. 


lor example, consider the description of a table of 


prices of shoes from three different manufacturers, each manufacturer 
Muking three different women’s shoe lasts and two different men’s shoe 
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lasts, where each of the women’s lasts comes in five different colors and 
eash: of the men’s lasts comes in two different colors. Figure 7-9 is a 
picture of the table. The table description is as follows: 


1 TABLE-SHOE-PRICES; SIZE 285 CHARACTERS. 
2 SHOE-MNFR; SIZE IS 95 oeceane aan 
IS NUMERIC; OCCURS 3 TI f 
Oe OMENS: SIZE Ts 25 CHARACTERS; OCCURS 3 TIMES. 
4 PRICE; PICTURE IS 99.99; OCCURS 5 TIMES. 
3 MENS; SIZE IS 10 CHARACTERS; OCCURS 2 TIMES. 
4 PRICE; PICTURE IS 99.99; OCCURS 2 TIMES. 


To obtain a value from a table in which the OCCURS clause has bee 
used, subseripted variables are used. The subscript 1 refers to the fir 
element of the OCCURS list, 2 refers to the second element of t 













Level 2 
SHOE-MNFR 
OCCURS 3 TIMES 









WOMENS 
OCCURS 3 TIMES 


=n ai 

OCCURS 2 TIMES ae 

womens | Et 
OCCURS 3 TIMES Yj 


WOMENS 
OCCURS 3 TIMES 


FOOTFORM 












WEAREASY 







WELLBUILT 


i i turer, kind, last, and color, 
Fia. 7-9. Table of shoe prices according to manufac A : 4 
squaves are the places for the prices: the shaded box, for example, conte ‘ 
for WEAREASY WOMENS shoes, their third last and fourth color; the so i 7 
contains the price for WELLBUILT MENS shoes, their first last and second color 


text). 


OCCURS list, etc. When more than one subscript is used, the ent 
data-name that is subseripted may be composed of successive level da 
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names, as described above. For example, 
PRICE IN WOMENS IN SHOE-MNFR (4, 3, 2) 


would represent the price of the shoes of the fourth color of the third type 
of women’s last of the second manufacturer (WEAREASY)—see the 
crosshatched square of Fig. 7-9. Note that the subscripts correspond to 
the occurrences in the successive component data-names. As another 
example, 


PRICE IN MENS IN SHOE-MNFR (2, 1, 3) 


would represent the price of the shoes of the second color, the first men’s 
last, and the third manufacturer—see the solid square in Fig. 7-9. A 
data-name can appear as a subscript, provided it has an integral value; 
then the data-name stands for its value in the subscript list. For 
example, 


PRICE IN WOMENS IN SHOE-MNFR (BLUE, 3, WEAREASY) 
would represent the crosshatched square in Fig. 7-9, provided that some- 
where in the data description there appear the items 


88 BLUE, VALUE IS 4. 
and 88 WEAREASY, VALUE IS 2. 


When no ambiguity will result, the data-name need not be composed. 
lor example, consider the following description of a matrix with six 
columns and three rows: 


1 MATRIX. 
2 COLUMN; OCCURS 6 TIMES. 
3 ROW-ELEMENT; OCCURS 3 TIMES. 


The actual matrix would be 
Qi1 Gig Gig Ai4 G15 A1e6 
G21 G22 23 G24 G25 26 
M31 G32 G33 G34 G35 G36 
'l'o refer to, say, the fourth element in the second row, we would write 
ROW-ELEMENT (2, 4) 


where 2 represents the second ROW-ELEMENT and 4 the fourth 
COLUMN, this latter being understood. 


It is frequently desirable to have alternative data-names as well as 


slructures for the same collection of elementary items in a record. In 
such a case, the REDEFINES clause is used as follows to indicate the 
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alternative desired: 
(level-number) (data-name) REDEFINES (data-name) ; (data description 


The alternative data-name is to the left, the original to the right, 0 
REDEFINES; the new description then follows the semicolon. Bot 
data-names, and both the original and new data descriptions, can the 
be used in the same program alternatively. For example, suppose tha‘ 
the original description were 


3 WOMENS-SHOE-COLOR; SIZE IS 25 CHARACTERS; 
USAGE IS COMPUTATIONAL. 
4 BLUE-PRICE; PICTURE IS 99.99. 
4 GREEN-PRICE; PICTURE IS 99.99. 
4 YELLOW-PRICE; PICTURE IS 99.99. 
4 PURPLE-PRICE; PICTURE IS 99.99. 
4 ORANGE-PRICE; PICTURE IS 99.99. 


The prices must then be referenced by naming the colors explicitly. If 
were also desired, as an alternative, to get the price through a subscript 
variable, then we would write 


3 WOMENS-SHOE REDEFINES WOMENS-SHOE-COLO 
SIZE IS 25 CHARACTERS. 
4 COLOR-PRICE; PICTURE IS 99.99; 
OCCURS 5 TIMES. 


Note that the data being described is the same in both cases, na 
a list of 5 prices of five characters each. Note that the range of 
data encompassed by a REDEFINES clause is determined by the le 
number of the original data-name; the new data-name must, of cou 
have this same level number. 

There is another interpretation of the REDEFINES clause: instead 
considering it as effecting a redefinition of the same data, it can be 
sidered as effecting the redefinition of the same storage area or part of 
high-speed memory. This latter interpretation becomes useful W 
there are involved in a program two different types of records that 
never utilized at the same time. If these records are of approximm 
the same length, then by means of the REDEFINES clause the 
area of the computer’s memory can be used to store either record, 
this way the computer’s memory can be more efficiently utilized, 
otherwise two separate areas of the memory would have to be rese 
for the records, even though both types of records would neve 
required in the memory at the same time. 

The RENAMES clause permits alternative or overlapping grou 
or classifications of the elementary data-items of a record, The 
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66 (data-name) RENAMES (data-name) 


where the level number 66 is used only to identify a RENAMES clause 
and the new or alternative data-name is to the left of RENAMES 
These clauses should directly follow the record description in which the 
renamed elementary data-items occur. For example, suppose there are 
five cost types included in a record and they are classified as costs for 
project 1 or project 2; suppose an alternative classification of these cost 
types into “direct” and ‘overhead”’ costs is desired, as indicated in the 
following table: 





Cost type 





A B C D E 








Project 1 cost....... 1 2 

Project 2 cost....... 1 2 3 
Direct cost.......... 1 2 3 
Overhead cost....... 1 2 




















I'he two classifications would appear as the following record descriptions: 


2PROJECT-1;.... 2DIRECT;.... 
3 COST-1;.... 3 COST-1;.... 
3 COST-2;.... 3 COST-2;.... 
2PROJECT-2;.... 3 COST-3;.... 
3COST-1;.... 2OVERHEAD;.... 
3 COST-2;.... 3 COST-1;.... 
3 COST-3;.... 3 COST-2;.... 


The left-hand record description is the original record, and it is desired 
Lo RENAME the elementary data-items so that the right-hand record 
(eseription will also apply to the data. 
(the following RENAME clauses will do the job: 


According to the above table, 


66 COST-1 IN DIRECT RENAMES COST-1 IN PROJECT-1. 
66 COST-2 IN DIRECT RENAMES COST-1 IN PROJECT-2. 
66 COST-3 IN DIRECT RENAMES COST-2 IN PROJECT-2. 
66 COST-1 IN OVERHEAD RENAMES COST-2 IN PROJECT-1. 
66 COST-2 IN OVERHEAD RENAMES COST-3 IN PROJECT-2. 


Of course the right-hand record never appears literally, but is implied 
hy the level-66 entries, 


The difference between the REDEFINES and 
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the RENAMES clauses is that the former redefines the same areas or 
locations in the computer memory regardless of the data stored there, 
whereas the latter renames the stored data itself, effectively (but not 
actually) introducing a different data-storage organization. 

The Working-storage Data Description. So far we have been consider- 
ing how to describe the record types that are contained in the files. These 
record descriptions tell the computer (1) the names of the elementary 
data-items as well as (2) just what area or part of the high-speed memory 
to reserve for each of the record types to be read into or out of the high- 
speed memory (from or onto the file tapes). In addition to these input- 
output file areas, there must be space reserved in the high-speed memory 
for temporaries to store intermediate results and for constants and initial 
conditions. Again, the computer must be told the names of these ele= 
mentary data-items to be stored, and how much space to reserve for them 
this is accomplished through the working-storage data description. 

The working-storage data description includes the working-stora; 
record descriptions, which are formed in exactly the same manner as 
the record descriptions discussed above and hence need no further di 
cussion. As in the case of all records, the data-names must be uniq 
when properly compounded, as described above. However, it is f 
quently desired that space be reserved for data that does not fit int 
part of a record, such as special temporary locations, initial values, and 
constants. Descriptions of such data, called noncontiguous workin 
storage data descriptions, are given separate from those of the workin 
storage records and are identified by the level number 77. They m 
contain the SIZE or PICTURE of the data and the CLASS and/ 
USAGE, in the manner discussed above. 
the other data-description clauses above. 

In the working-storage data description, the clause “VALUEIS . . , 
can be used to give the initial value of a data-name. For example, if 
sum is being formed, we might have 


77 SUM; SIZE IS 7 DIGITS; 
CLASS IS NUMERIC; VALUE IS 


Similarly, a VALUE IS clause can also be used for an initial value with 
arecord. For constants the same form can be used, except that, of cou 
the VALUE IS clause is required, in order to give the value of the eo 
stant: for example, 


77 INTEREST-RATE; SIZE IS 2 DIGITS; 
CLASS IS NUMERIC; VALUE IS 


If the constants bear relationships to one another, then the const 
can be written as a record description. Here each of the element 


They can also contain any 
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data-item descriptions must include a VALUE IS clause. However, since 
cach elementary data-item is supposed to have only one (constant) value, 
no level-88 entries can be written (why?). 

Tables of constants are handled in a very special way. The table is 
first described by means of a record description. The values of the 
entries are given by the VALUE IS clause; then this record can be 
REDEFINED, with the OCCURS clause in the redefinition. In this 
way subscripted variables can reference the table of constants (see 
Icxercises 7-37 through 7-40). Here level-88 entries can be written. 


7-5. File and Environment Descriptions 


The File Description. As we observed above, the data records are 
yenerally viewed as being stored on magnetic tape, sequentially. A 
collection of records is called a jfile, and we have assumed, for the sake of 
simplicity, that a file contains only a single type of record and that only 
one file appears on each tape. The data-record description, the topic of 
the previous section, was needed to describe the organization of the data 
in the high-speed memory, reserving the appropriate area in the memory 
for the records and working storage, as well as to “assign”? data-names 
to the various data “locations” in the high-speed memory. The file 
descriptions, on the other hand, are necessary (1) to detail the organi- 
vation of the records in the magnetic-tape files and identify the files, i.e., 
(o give the number of records in a file and the number of characters in a 
record, (2) to give the names of the record-types of the file, and (3) to 
give the formats of additional labels that may have been put on the 
beginning or end of the file tapes for checking and handling purposes. 
Whereas the record description was generic in character, describing the 
format of a record-type whose file may contain many thousands of such 
records, there is only one file associated with each file description. The 
form of the file description is 


er eee ee Re _ ; /label-record \ 
(file description) ::= FD (file-name); (file data) \denokiptionay 
(file data) :: = (file organization) (label-record types) 


(data-record names) 


Ilere FD denotes that what follows is a file description; the file-name is a 
word; the file data has three parts, which we shall now describe in detail. 


The File Data. The “‘file-organization” part of the file data contains 
three clauses: 


FILE CONTAINS 
The first is written as 


FILE CONTAINS ABOUT (integer) RECORDS; 


BLOCK CONTAINS RECORD CONTAINS 
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where the integer gives the approximate number of records in the particu- 
lar file in question, which may be useful in the handling of the file tape. 
Usually a group, or block, of computer (machine-language) words at a 
time is read into (out of) the high-speed memory from (onto) the mag- 
netic tapes. If the number of computer words is w, and if each record 
will require r such computer words, then the number of records per block 
is about w/r, taken to the nearest integral value. Thus, instead of one 
record at a time being read into (or written out of) the high-speed memory 
as we have previously assumed, in actuality several records at a time are 
transferred. These records are stored in the computer memory ‘on the 
side,” and only one record at a time (from each file) is put into the recon 
area of that particular file. Hence we can say that effectively one recor 
at a time from each file is handled by the computer. However, fo 
manipulation purposes we must tell the computer that each 


BLOCK CONTAINS (integer) RECORDS; 


Finally, it is helpful in handling the file tapes for the computer to 
told in the file description that each 


RECORD CONTAINS (integer) CHARACTERS; 


(even though the record description itself specifies this number). 
The data-record-name part of the file data is of the form 


(record-name list) :: = (record-name)|(record-name list), (record-name 


Is ; 
DATA RECORDS at (record-name list). 


where a “record-name” is the data-name of the first level of a d 
record, i.e., the true record name. We have been making the sim 
fying assumption that there is only one record-type in a file, in which ¢ 
only its name would appear in the “regord-name list’? and the IS opti 
would be used. If more than one record-type is in a file, then the na 
of each of the record-types would appear in the “reeord-name list” 
the ARE option would be used. 

Label Records. The “label-record descriptions”’ part of the file desert 
tion and the “label-record types” part of the file data concern the noth 
of label records themselves. A label record is a special record that 
placed on the file tape at the beginning, and often also at the end, of t 
file and/or tape (see Fig. 7-10). The constituents of a label record it 
depend on what computer is being used, as well as on the particu 
problem for which the file was prepared. The label records give # 
things as the date of composition of the file, the date at which the 
will be obsolete, the reel number of a file that requires several reels, 
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other checking and file- or tape-handling information that may be deemed 
necessary or helpful. The description of a label record is similar to, and 
can use the clauses of, the data-record discussed above. As seen eon 
l‘ig. 7-10, there are four types of label records that may be used on a tape: 


(label-record type) :: = BEGINNING-TAPE|BEGINNING-FILE 
[ENDING-FILE|ENDING-TAPE 

_ /label-record\ | /label-record-\ /label-record 

\ type /|\ typelist /’\ type 


Ilence the ‘“Jabel-record type”’ part of the file data includes the clause 


| [* stanDak - 


/label-record-\ _ 
\ type list / °° 


IS 


ARE STANDARD 


OMITTED 


‘The use of the STANDARD option implies that the label records adhere 
in type and in individual format to a standard form set up by computer 
manufacturer or other implementer. If this “standard” option is used 
then the label-record description need not be explicitly included, but i 
“understood.’”’ A standard label-record description follows: 


1 BEGINNING-TAPE; SIZE IS 25 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 IDENTIFICATION ; SIZE IS 9 CHARACTERS; 
CLASS IS ALPHANUMERIC. 

2 REEL-NUMBER;; SIZE IS 4 DIGITS; CLASS IS NUMERIC. 

2 DATE-WRITTEN;; SIZE IS 6 DIGITS; CLASS IS NUMERIC. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 

2 PURGE-DATE; SIZE IS 6 DIGITS; CLASS IS NUMERIC. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 


LABEL RECORDS 


When it is not desired to have label records at all, then the OMITTED 
option is used and the label-record descriptions are omitted. In all other 
tunes, the label-record-type list is given and the label-record descriptions 
are included. 


es “label-record types’ part of the file data includes another clause, 
of the form 


/data-name in\ 


VALUE OF \ label record / 


(data-name in constants) 
(data-name in data-record) HASHED 
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The purpose of this clause is to serve as a check or identification, and it 
can be used in either of two possible ways. In the first, the label record 
on the tape contains a certain constant or word, as the “contents” of the 
data-name following VALUE OF; this constant or word is then compared 
with a constant, whose data-name follows the IS, in the constants section 
of the working storage: if the comparison is successful, then the identifi- 
cation of the file has been confirmed; otherwise an error is indicated by 
the computer. In the second way, the label record on the tape again — 






































BEGINNING-FILE | DATA DATA DATA ; DATA ENDING-FILE 
LABEL RECORD | RECORD | RECORD RECORD | ** RECORD | LABEL RECORD 
(a) 
BEGINNING-TAPE| BEGINNING-FILE DATA DATA be DATA ENDING-TAPE 
LABEL RECORD | LABEL RECORD RECORD | RECORD 6 RECORD | LABEL RECORD 
BEGINNING-TAPE| DATA DATA Pea DATA ENDING-FILE ENDING-TAPE 
LABEL RECORD | RECORD | RECORD RECORD | LABEL RECORD | LABEL RECORD 
BEGINNING-TAPE | BEGINNING-FILE DATA sgn ENDING-FILE BEGINNING-FILE | DATA ete ENDING-FILE ENDING-TAPE 
LABEL RECORD | LABEL RECORD | RECORD LABEL RECORD | LABEL RECORD | RECORD LABEL RECORD | LABEL RECO! 
(c) 


Fra. 7-10. (a) Illustration of records in a file that takes up a single tape reel; (6) ill 
tration of records in a file that requires two (or more) tape reels; (c) illustration 
records in files where more than one file appears on a tape. 


contains a certain constant or word; this time, however, it is not co 
pared with the “eontents” of the data-name in the data record (give 
after IS), but rather with the sum of the contents of the data-reco 
data-names of each record of the file—i.e., the summation is taken 0 
all the records of the file under consideration. This is the meaning 
the word HASHED that appears in this second option of the clau 
Then, as each data-record is read into (out of) the high-speed memo 
the total sum is formed and at the proper time (i.e., when all the da 
records have been read) this total is compared with the label-record da 
name for the check. In this way, an omitted data-record or other su 
error can be detected. 

The “label-record descriptions” give the format of each label reco 
associated with the file under consideration, and these records appe 
after the file data. (The label records themselves appear on the file ta 
as shown in Fig. 7-10.) The following paragraph is an example of 
complete file description, which also includes (nonstandard) label-ree 
descriptions: 


FD PAYROLL-DATA; FILE CONTAINS ABOUT 1000 RECOR 
BLOCK CONTAINS 10 RECORDS; RECORD CONTAINS 70 CH 
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ACTERS; LABEL RECORDS ARE BEGINNING-FILE, ENDING- 
FILE; VALUE OF ID IS FILE-NAME; DATA RECORDS IS MASTER. 
1 BEGINNING-FILE; SIZE IS 19 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
21D; SIZE IS 3 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 REEL-NUMBER; SIZE IS 4 DIGITS; 
CLASS IS NUMERIC. 
2 JOBCLASSIFICATION; SIZE IS 6 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 LASTUPDATING;; SIZE IS 6 DIGITS. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 
1 ENDING-FILE; SIZE IS 12 DIGITS; CLASS IS NUMERIC. 
2 RECORD COUNT; SIZE IS 3 DIGITS. 
2 PAYROLL TOTAL; SIZE IS 9 DIGITS. 


The Environment Description.t| In the previous section, we considered 
how the data that enters into a problem can be described in terms of 
record formats; in this section, we have considered how a collection of 
records could be organized in a file, and how such a file can be described. 
ln the present paragraph, we shall describe how the files themselves 
win be related to the equipment available on the particular computer 
heing used—i.e., the environment description. As we noted above, the 
simplest concept is to have one file on a single magnetic-tape init an 
input file would be contributing data (records) to the high-speed memory 
for processing, and an output file would be receiving from the high-speed 
iiemory information which results from the processing. Of course, the 
input file could be on punched cards instead of magnetic tapes, the out- 
put file could be the printed results from a high-speed printer, and so 
forth. Hence, when a problem is being run, each piece of input or out- 
pul equipment (hardware) which is being used must be assigned to a 
particular file. 

The assignment of files to the input and output units (hardware) is 
wccomplished through the input-output section, which has the form 


(hardware list) :: = (hardware-name)|(hardware list), (hardware-name) 
(assignment) :: = SELECT (file name), ASSIGN TO (hardware list). 


{ In this environment description we have chosen to omit many of the alternative 


fowalbilities provided for in COBOL; this has been done for ease in understanding of 
the concepts and to eliminate unnecessary complications for the student in his first 
posure to the language, 
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(assignment list) :: = (assignment)| 
(assignment list) 
(assignment) 
(input-output section) :: = FILE CONTROL. 
(assignment-list}) 


For example, suppose that our computer has four magnetic-tape unii 
and one printer, named (hardware-names) MTU1, MTU2, MTU3, MTU: 
and PRINTER. Suppose also that, for a particular problem, we intend 
to put the file named MASTER on tape MTU1, the file named TRAN 
ACTION on tapes MTU2 and MTUS3 (that is, this file requires two tapes 
the file named NEW-MASTER on tape MTU4, and the file nam 
REPORT on the PRINTER. The input-output section would t 
appear as 


FILE CONTROL. 
SELECT MASTER, ASSIGN TO MTU1. 
SELECT TRANSACTION, ASSIGN TO MTU2, MTUSB. 
SELECT NEW-MASTER, ASSIGN TO MTU4. 
SELECT REPORT, ASSIGN TO PRINTER. 


In addition, the environment description should include some dese 
tion of the computer itself, of that part of the high-speed memory of 
computer which is to be reserved for the program in question, and 
those input-output units (hardware) which are to be utilized by 


computer. This information is specified in the configuration seelt 
which has the following form: 


{memory segment) ::= (literal) THRU (literal) 

/ memory- ) vas /memory\ | / memory- \ /memo 
\segment list/ “" ePERES> \segment/ | \segment list/’ \segmen 
/configuration\ |, 


OBJECT-COMPUTER. (computer name) 
ASSIGN OBJECT-PROGRAM TO { ha 


\ section / °° 


\ name 
MEMORY SIZE (memory-segment list). 
(hardware list). 


The term OBJECT-COMPUTER refers to the computer being used 
run the program, for example, the IBM 7090, RCA 501, Honeywell 
etc. The OBJECT-PROGRAM is the program, written in the CO 
language, which is to be run; this program itself must be stored of 
input unit. The configuration section of the program must be read 
the computer from the input unit before the computer will know W 
the rest of the program is located. The MEMORY SIZE clause 
fies the segments of the high-speed memory that will be utilized in 
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program, and the hardware list is the list of those input-output units 
that will be used in the running of the program. These should, of course, 
agree with input-output (FILE CONTROL) section. As an example, 
suppose we are to run a program on a computer called the PEDAGAC, 
that the object-program is on tape MTU5, that memory addresses 000 
(hrough 400 and 600 through 777 are to be used, and that input-output 
units MTU1, MTU2, MTU3, MTU4, and the PRINTER are to be used. 
The configuration section would then be 


OBJECT-COMPUTER. PEDAGAC 
ASSIGN OBJECT-PROGRAM TO MTU5. 


MEMORY SIZE ADDRESS 000 THRU 400, 600 THRU 777. 
MTU1, MTU2, MTU3, MTU4, PRINTER. 


With a slight stretch of the imagination, we can consider the name of 
the writer or author (programmer) of a program, as well as other infor- 
mation concerning the writing of the program, as part of the program’s 
environment. This program identification section has the following form: 


(program identification) :: = PROGRAM ID. (name)| 
(program identification) 
AUTHOR. (name)! 
(program identification) 
DATE WRITTEN. (date)| 
(program identification) 
REMARKS. (discussion). 
lor example, 
PROGRAM ID. PAYROLL 

AUTHOR. ROBERT S LEDLEY 

DATE-WRITTEN. JULY 22, 1961 

REMARKS. PAYROLL-DATA-FILE AND TOTAL-MASTER- 

FILE NEEDED AS INPUTS FOR THIS PROGRAM. 


In conclusion, the Data Division of our simplified version of COBOL 
in organized as shown in Fig. 7-11. In the actual COBOL, the file- 
wontrol and object-computer parts are included in a separate Environ- 
ment Division, the program identification is included in a separate 
Identification Division, and the Data Division includes only what we 
have called the data description. 


7-6. Data Handling by COBOL 


Orientation, So far we have considered (1) the operational statements 


of COBOL, which process the data by computing functions and carrying 
oul the vetual numerical caleulations (See, 7-2), and (2) the data-descrip- 
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tion clauses of COBOL, which define the data (as previously organized) 
on tape and reserve the proper high-speed-memory areas for input and 
output files and working storage (Sees. 7-4 and 7-5). There remain three 
further topics necessary to complete the discussion of COBOL. The first 
is the data-handling statements of COBOL, used to move or manipulate 
the data—in a manner analogous to the copying of entries into different 

























































































PROGRAM ID Program 
identification 
OBJECT COMPUTER Configuration Environment... 
section description 
FILE CONTROL Input-output 
section 
FD 
File 
description 
<label-records > 
< data-records > Data-record 
description bats 
FD description 
<label-records > | 
< data-records > | 


Fra. 7-11. Data Division of COBOL sharing only two file descriptions and oe 
record descriptions; the dashed line indicates that there are more of these if more t 
two files are used. 


columns which is so characteristic of bookkeeping; the second is the d 
editing clauses of COBOL, used to define the proper or desired form 
the data for the printing of reports, fiscal summaries, check informatio 
etc.; and the third is the jfile- and record-handling statements that are u 
to sequence the records of a file tape properly for processing. ‘I hese W 
shall now consider individually. 

Data-handling Statements. The data-handling statements are 


MOVE and MOVE CORRESPONDING 


The MOVE verb is used to transfer data from an input-file area 1 
working-storage or output-file area or from a working-storage area tO 
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output-file area, etc. The syntax of the MOVE statement is as follows: 
(sending field) :: = (data-name) 

(receiving-field list) ::= (data-name)|(receiving-field list), (data-name) 


(move statement) :: = MOVE (sending-field) TO (receiving-field list) 


The MOVE statement is analogous to a transfer instruction except that 
the sending field may be moved (transferred) into each of several differ- 
ent “locations” at once (i.e., each of the “locations” of the receiving-field 
list). For example, 


MOVE BADGE-NO IN MASTER TO BADGE-NO IN T-MASTER, 
BADGE-NO IN N-MASTER. 


This statement would move the elementary data-item with data-name 
BADGE-NO of the MASTER record of, say, an input file into the analo- 
gous data-name of the working-storage T-MASTER record and output- 
lile N-MASTER record. 

The MOVE CORRESPONDING verb is used to move a group of ele- 
mentary data-items, all at one time, into a corresponding group of new 
clementary data-item locations. The form of this variation of the MOVE 
statement is 


MOVE CORRESPONDING (sending area) TO (receiving-area list) 


Where (sending area) :: = (partial data-name) 
(receiving-area list) :: = (partial-data-name list) 


Ilere by “partial data-name”’ we mean a level name under which level 
higher levels exist. Of course the sending and receiving areas must 
include corresponding higher associated levels. For example, consider 
the two records: 


1 MASTER; SIZE IS 6 DIGITS. 
2 DATE; SIZE IS 6 DIGITS. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 3 YR; SIZE IS 2 DIGITS. 


nd suppose we wished to transfer the DATE from the MASTER record 
lo the N-MASTER record. We would write 


MOVE CORRESPONDING DATE IN MASTER 
TO DATE IN N-MASTER. 


If the data descriptions of the sending and receiving fields or areas are 


1 N-MASTER; SIZE IS 6 DIGITS. 
2 DATE; SIZE IS 6 DIGITS. 
3 MO; SIZE IS 2 DIGITS. 


iol the same, then the data-items are moved (by the MOVE verb) in 
oonformity with the description of the receiving field or area. 
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Data-editing Clauses. When business, planning, medical, and othe 
such data are handled, which characteristically require listings, sum 
maries, totals, breakdowns, and so forth, as the final data-processin 
output, the format of the final print-out is of the utmost importance 
The purpose of the data-editing clauses of COBOL is to control an 
process such formats. 

In discussing these clauses, concrete, physical pictures of the locatio 
of each character of the data must be kept in mind. In a record descri 
tion, the number of characters in an elementary data-item is specified b 
either the SIZE or the PICTURE clause: thus a particular field in t 
computer memory is specified. For example, suppose we specify t 
the elementary data-item NAME is to have 15 characters; or the pict 
may be given as 


XXXXXXXXXXXXXXX 


Then the field reserved can be diagramed as 


Pes) 


If the name is, say, JOHN JONES, which is smaller than the reser 
field, the question arises of where it will be located in the field. Ag 
if data is being moved from one field to another by means of the MO 
verb, similar problems arise when the sending and receiving fields are 
different sizes. 

When data is printed out, it is often desirable to have, for example, 
$ to the left of the leftmost digit, as in $5.95 or $25.95, etc. Clearly 
actual computations cannot be performed with the dollar sign so pla 
especially where it may not be known beforehand how many digits t 
will be in, say, a final total. Here, as in many other cases, certain 8 
bols must be appropriately placed in the final results, so that the p 
out will appear in the proper report form. 

The data-editing clauses utilized when such problems arise are 


JUSTIFIED POINT LOCATION Editing 


The JUSTIFIED clause is used when nonstandard positioning of an 
mentary data-item is desired in a field that is set up for more charaé 
than the data-item contains. The standard positioning is as follo 
For alphabetical or alphanumerical data, the data is positioned fl 
with the left end of the field, with blanks filling in any right-hand sp 


For example, in a NAME field of 15 characters, the standard positio 
of JOHN JONES would be 


[Jlo[win| [slojnwie|s] | | | | 


Numerical data for which the positioning of the decimal point has 
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specified (see below) must of course be recorded in its field properly 
positioned with respect to this point position. Other numerical data is 
positioned flush with the right end of the field, with zeros filling in any 
left-hand spaces. For example, in a field of five characters, the standard 


position of 595 is 
0/0] 5/95] 


When such standard positioning of data is not desired, the JUSTIFIED 


clause can be used in the data description, in the following alternative 
forms: 


JUSTIFIED EET 


RIGHT 


The positioning of a decimal (or binary) point in a number is given by 
(he PICTURE clause described above; the positioning of the decimal (or 


binary) point in a field in which a number is to be placed is specified by 
the POINT LOCATION clause: 


POINT LOCATION IS LEFT (integer) Sea 


Hlere “integer” gives the number of positions to the left of the rightmost! 
(lonst significant) position of the field, at which the decimal (or binary) 
point location is to be assumed. The decimal point does not actually 
ippear in the field, because of the point-location clause. 


For example, 
for a field for which 


SIZE IS 5 DIGITS; POINT LOCATION IS LEFT 2 PLACES 


we have 
oo 


assumed point location 


'o obtain an actual decimal (or binary) point character, the PICTURE 
‘lwuse must be utilized. If data is moved (by means of a MOVE state- 
fnont) from a sending field with a POINT LOCATION clause to a 
feveiving field with a PICTURE clause, then the decimal (or binary) 


joints are made compatible. For example, 


Sending field 
POINT LOCATION IS LEFT 2 PLACES 


[00/5] 9/5 
t 


assumed point location 


Receiving tield 
PICTURE IS $999.99 


= [solos]. [9]5| 
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The Editing clauses enable three kinds of “variable” editing options 
to be applied to numbers. These are written as 


CHECK PROTECT 


ZERO SUPPRESS 
FLOAT DOLLAR SIGN 


An editing option is applied to a number from left to right and sup- 
presses (or changes) the leading zeros until a nonzero digit or the decimal 
point (actual or assumed) is encountered; the option is automatically 
applied to a number as it is moved (by a MOVE verb) into a field o 
area whose data description includes such an editing option. The effect 
of each of the editing options is as follows: 

1. ZERO SUPPRESS. The leading zeros are replaced by blanks, 
For example, 


Sending field Receiving field 


ZERO SUPPRESS 
= [1 { [5/9/5) 


9. CHECK PROTECT. The leading zeros are replaced by asterisk 
For example, 


Sending field Receiving field 
PICTURE IS $999.99; 
POINT LOCATION IS LEFT 2 PLACES CHECK PROTECT 


0|0/5|9/5| = [s[*[+[5]. [915] 


3. FLOAT DOLLAR SIGN. The rightmost zero suppressed will 
replaced by the dollar sign. For example, 


Sending field Receiving field 
PICTURE IS $999.99; 


POINT LOCATION IS LEFT 2 PLACES FLOAT DOLLAR SIGN 


[olojs [915] = LL Is]5}. [915 


Note that, as illustrated by the last few examples, the PICTURE cla 
itself can accomplish some editing. The editing clauses then lend furt 
flexibility to the editing processes. 

File- and Record-handling Statements. Figure 7-5 described the pro 
by which successive records were transferred from the file tapes into 
high-speed memory for processing. However, the astute reader will ha 
observed that until now we have not described statements of COB 
which accomplish this task, or the companion task of putting proce 
information onto file tapes, or “onto’’ high-speed printers. We 4 
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describe these input-output statements: 


OPEN CLOSE READ WRITE 


Prior to the initial use of an input or output file, an OPEN statement 
must be applied to the filename. The execution of an OPEN statement 
establishes an input- or output-file storage area and performs appropri- 
ate checks on the file label records. Similarly, when no further use is 
required of an input or output file, a CLOSE statement must be applied 
to the filename. The CLOSE statement releases the input- or output- 
file area for other use, performs the appropriate closing conventions that 
may involve the label-records of the file, and usually rewinds the file- 
tape reels. A file may be closed only once for each time it is opened; 
an opened file cannot be opened again until a CLOSE statement has first 
been applied to its filename. The formats of the OPEN and CLOSE 
statements are as follows: 


(file-name list) ::= (file-name)|(file-name list), (filename) 
(in-out file-name list) :: = INPUT (filename list) 
|OUTPUT (file-name list) 


(open statement) :: = OPEN (in-out file-name list) 
(rewind options) :: = NO REWIND|LOCK 
(file-name options) :: = (file-name)|(file-name) WITH (rewind options) 
({ile-name-options list) ::= (file-name options) 

\(file-name-options list), (file-name options) 
(close statement) :;= CLOSE (file-name-options list) 


The execution of the “open statement” will apply all the opening pro- 
cedures to each file named in the file-name list. The execution of the 
“close statement” will apply all the closing procedures to each file named 
in the file-name-options list. If no “rewind options” are associated with 
a file-name, then the file is rewound automatically; if the NO REWIND 
option is used, the tape is left in its last position before the close state- 
ment was executed; if the LOCK option is used, the tape is rewound but 
upecial precautions are taken so that the tape cannot be read from or 
written upon again. 

After an “open statement” is applied to a file, it is called open until a 
close statement is applied to it, after which it is called closed. (Also, 
hefore the initial “open statement” has been executed, the file is called 
closed.) The READ and WRITE statements can be applied only to 
open files, for only these files have input or output storage areas reserved 
in the high-speed memory for their records. Each time a READ state- 
ment is applied to an input-file-name, one record is (effectively) read 
from its tape into the input storage area reserved for that file, replacing 
the previous contents of this storage area. Each time a WRITE state- 
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ment is applied to a record name, this record is written onto the appropri- 
ate output-file tape. In this way a COBOL program can be made suc- 
cessively to process one record of a file at a time. A typical sequence of 
events might be as in Fig. 7-12. 

The READ statement has the following format: 


READ (file-name) RECORD; AT END (nonconditional statement). 


When a file consists of more than one type of record, these records auto- 
matically share the same storage area, sequentially. This is equivalent 
to saying that there exists an implicit redefinition of the area, and that 


OPEN INPUT FILEA, OUTPUT FILEB 


READ a FILEA record into the FILEA storage area 


(call in record i of file A) 
Go to next record 
(let i+ 1—>i, j7+1—>/) 
WRITE a record onto FILEB from the 
FILEB storage area (write record 7 onto file B) 
Have all records of 
FILEA been processed? 
No Yes 
CLOSE FILEA, FILEB 


Fia. 7-12. Typical loop for sequencing the processing of the records of a file. 


























Process and manipulate 
the data (process record i 
producing record 7 ) 






only the information which is present in the current record, just “read in” 
by means of the most recent READ instruction, is available. No refers 
ence in the processing of the record (by the procedure division) can bé 
made to information which is not actually present in the current record, 
When all the records have been READ from a file, the end-of-file label 
record is sensed and the ‘‘nonconditional statement” following AT END 
is executed. For instance, we might have 


READ FILEA RECORD; AT END GO TO LABELA. 
Here, after the last record of FILEA has been processed, the progr 
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will jump to the subroutine labeled LABELA, which provides the exit 
for the processing loop. 
The WRITE statement has this format: 
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(from clause) :: = FROM (record-name) 

/advancing\ _._ | AFTER (data-name) LINES 
alae ors ADVANCING | (integer) LINES 

fe Re ages WRITE (record-name) [(from clause)] [< anne) | 

\statement/ *’ \ clause 


The ‘“record-name”’ following WRITE must be the name of a record in 
the output-file storage area, which thus specifies which file is to be written 
upon. This record is then written out onto its associated file each time 
the WRITE statement is executed. The [square] brackets indicate two 
possible options, either or both of which may be included or omitted. 
If the record to be written out is not in an output-file area, but is in an 
input-file area or working-storage area, then the “from clause’’ is used, 
with the name of the desired record in these other areas included in the 
“from clause.”? The effect of using the “from clause” is to MOVE the 
record named in the clause automatically to the record named after 
WRITE, and then to write this onto the associated file. The “advanc- 
ing clause” is used only when the output file is actually to be a print-out 
on a high-speed printer. 

A Complete Example. At the end of this section we present a complete 
example of a simplified COBOL program to do a monthly payroll. Three 
files are involved, the initial payroll information file, called PAYROLL- 
INFO-FILE, the updated payroll file to be used the following month, 
culled NEW-PAYROLL-INFO-FILE, and the paycheck data file, called 
PAYCHECK-FILE. For simplicity it is assumed that each employee is 
paid on a regular monthly salary rate. ach month the Social Security 
und Federal income-tax withholdings are deducted from this monthly 
sulary rate, according to the following formulas: 

1. Monthly Social Security is 0.031 X salary rate, until the accumu- 
lated payment has reached $148.80, after which no further payments 
(deductions) are made. 

2. Monthly income-tax withholding = 0.18 X (salary rate — N X 56), 
where N is the number of dependents claimed. Of course the tax with- 
holding cannot be negative. 

The amount of the monthly paycheck is equal to the salary rate less 
(hese two deductions. In addition, the new payroll file must be updated 
with the total income tax withheld to date and the total Social Security 
withheld to date, as well as the total gross salary earned to date. The 
student should study the example carefully to see how each of these items 
in wecomplished, 
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In a more realistic payroll problem, the occurrence of a new employee, 
the dropping of an employee, the changing of the salary rate, or the 
number of dependents should also be taken into account (see Exercise 
7-42). In addition, hourly wages and part-time employment must also 
be handled (see Exercise 7-44). Other withholding items, such as state 
income tax, retirement and insurance, regular savings or bond buying, 
and so forth, must often be considered (see Exercises 7-43 and 7-45). 
Finally, the employer’s monthly deposit of withholdings must be com- 
puted, and the quarterly form 941, annual forms 940 and W-3, and the 
W-2 form for each employee must all be computed. Thus it becomes 
clear that even such an apparently straightforward data-processing prob- 
lem can become quite complex. The student will gain valuable experi- 
ence by trying the exercises mentioned. 
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DATA DIVISION. 


PROGRAM ID PAYROLL. 
AUTHOR. ROBERT S. LEDLEY 
DATE WRITTEN. JULY 22, 1961 
REMARKS. PAYROLL-INFO-FILE IS INPUT, PAYCHECK- 
FILE AND NEW-PAYROLL-FILE ARE OUTPUTS, NO 
CHANGES IN PAYROLL DATA LIST ARE INCLUDED. 


OBJECT COMPUTER. PEDAGAC 
ASSIGN OBJECT-PROGRAM TO MTU5. 
MEMORY SIZE ADDRESS 000 THRU 1000. 
MTU1, MTU2, PRINTER. 


FILE CONTROL. 
SELECT PAYROLL-INFO-FILE, ASSIGN TO MTU1. 
SELECT NEW-PAYROLL-INFO-FILE, ASSIGN TO MTU2. 
SELECT PAYCHECK-FILE, ASSIGN TO PRINTER. 


FD PAYROLL-INFO-FILE; FILE CONTAINS ABOUT 1000 RECORDS 
BLOCK CONTAINS 10 RECORDS; RECORD CONTAINS 
CHARACTERS; LABEL RECORDS ARE BEGINNING-FIL 
ENDING-FILE; VALUE OF ID IS PIF; DATA RECORDS i 
PAYROLL. 

1 BEGINNING -FILE; SIZE IS 9 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 ID; SIZE IS 3 CHARACTERS; CLASS IS ALPHABETI 
2 LASTUPDATING;; SIZE IS 6 DIGITS. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 
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1 ENDING-FILE; SIZE IS 3 DIGITS; CLASS IS NUMERIC. 
2 BLOCK COUNT; SIZE IS 3 DIGITS. 
1 PAYROLL; SIZE IS 50 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 NAME; SIZE IS 12 CHARACTERS; 
CLASS IS ALPHABETIC. 
3 LAST; SIZE IS 10 CHARACTERS. 
3 INITIALS; SIZE IS 2 CHARACTERS. 
2 SSNO; SIZE IS 9 DIGITS. 
2 SALARY-RATE; PICTURE IS 9999.99. 
2 EXEMPTIONS; SIZE IS 1 DIGIT. 
2 GROSS-TO-DATE; PICTURE IS 99999.99. 
2 SSWHLD-TO-DATE; PICTURE IS 999.99. 
2 FED-TAX-WHLD-TO-DATE; PICTURE IS 9999.99. 


FD PAYCHECK-FILE; FILE CONTAINS ABOUT 1000 RECORDS; 
BLOCK CONTAINS 10 RECORDS; RECORD CONTAINS 42 
CHARACTERS; LABEL RECORDS OMITTED; DATA RECORDS 
IS CHECK. 

1 CHECK; SIZE IS 42 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 NAME; SIZE IS 12 CHARACTERS; 
CLASS IS ALPHABETIC. 
3 LAST; SIZE IS 10 CHARACTERS. 
3 INITIALS; SIZE IS 2 CHARACTERS. 
2 GROSS-PAY; PICTURE IS $9999.99. 
2 DEDUCTIONS. 
3 SS; PICTURE IS $999.99; FLOAT DOLLAR SIGN. 
3 FED-TAX; PICTURE IS $999.99; 
FLOAT DOLLAR SIGN. 
2 AMOUNT; PICTURE IS $9999.99; CHECK PROTECT. 


NEW-PAYROLL-INFO-FILE; FILE CONTAINS ABOUT 1000 
RECORDS; BLOCK CONTAINS 10 RECORDS; RECORD CON 
TAINS 50 CHARACTERS; LABEL RECORDS ARE BEGINNING- 
FILE; ENDING-FILE; VALUE OF ID IS NPF; DATA RECORDS 
IS NEW-PAYROLL. 
1 BEGINNING-FILE; SIZE IS 9 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 ID; SIZE IS 3 CHARACTERS; CLASS IS ALPHABETIC. 
2 DATE; SIZE IS 6 DIGITS. 
3 MO; SIZE IS 2 DIGITS. 
3 DA; SIZE IS 2 DIGITS. 
3 YR; SIZE IS 2 DIGITS. 
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1 ENDING-FILE; SIZE IS 3 DIGITS; CLASS IS NUMERIC. 
2 BLOCK COUNT; SIZE IS 3 DIGITS. 
1 NEW-PAYROLL; SIZE IS 50 CHARACTERS; 
CLASS IS ALPHANUMERIC. 
2 NAME; SIZE IS 12 CHARACTERS; 
CLASS IS ALPHABETIC. 
3 LAST; SIZE IS 10 CHARACTERS. 
3 INITIALS; SIZE IS 2 CHARACTERS. 
2 SSNO; SIZE IS 9 CHARACTERS. 
2 SALARY-RATE; PICTURE IS 9999.99. 
2 EXEMPTIONS; SIZE IS 1 DIGIT. 
2 GROSS-TO-DATE; PICTURE IS 99999.99. 
2 SS WHLD-TO-DATE; PICTURE IS 999.99. 
2 FED-TAX-WHLD-TO-DATE; PICTURE IS 9999.99. 


WORKING STORAGE. 
77 SOC-TEMP; PICTURE IS 999.99. 
77 TAX-TEMP; PICTURE IS 9999.99. 
77 SS-RATE; SIZE IS 3 CHARACTERS; CLASS IS NUMERIC; 
POINT LOCATION IS LEFT 3 PLACES; VALUE IS 031. 
77 PIF; SIZE IS 3 CHARACTERS; CLASS IS ALPHABETIC. 
77 NPF; SIZE IS 3 CHARACTERS; CLASS IS ALPHABETIC. 
77 ZERO; PICTURE IS 9999.99; VALUE IS 0000.00. 


PROCEDURE DIVISION. 


HOUSEKEEPING-OPEN. 
OPEN INPUT PAYROLL-INPUT-FILE, OUTPUT PAYCHECK- 
FILE, NEW-PAYROLL-INFO-FILE. READ PAYROLL-INPUT 
FILE RECORD; AT END GO TO HOUSEKEEPING-CLOSE.} 


UPDATING-SSWHLD-AND-FED-TAX. 
COMPUTE SOC-TEMP FROM SS-RATE TIMES SALARY-RATE 
IN PAYROLL. IF SOC-TEMP IS GREATER THAN 148.80 
MINUS SSWHLD-TO-DATE IN PAYROLL THEN MOVE 148.80 
TO SSWHLD-TO-DATE IN NEW-PAYROLL; COMPUTE SS IN 
DEDUCTIONS FROM 148.80 MINUS SSWHLD-TO-DATE IN 
PAYROLL ELSE MOVE SOC-TEMP TO SS IN DEDUCTIONS}; 
COMPUTE SSWHLD-TO-DATE IN NEW-PAYROLL FROM 
SOC-TEMP PLUS SSWHLD-TO-DATE IN PAYROLL. COM 
PUTE TAX-TEMP FROM 0.18 x (SALARY-RATE IN PAYROLL 
MINUS EXEMPTIONS TIMES 656.00). IF TAX-TEMP I§ 
GREATER THAN ZERO THEN MOVE TAX-TEMP TO FED=« 


} AT END clause is required by the syntax, even though the program never returnit 
to this point. 
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TAX IN DEDUCTIONS; COMPUTE FED-TAX-WHLD-TO-DATE 
IN NEW-PAYROLL FROM TAX-TEMP PLUS FED-TAX-WHLD- 
TO-DATE IN PAYROLL ELSE MOVE ZERO TO FED-TAX IN 
DEDUCTIONS; MOVE FED-TAX-WHLD-TO-DATE IN PAY 
ROLL TO FED-TAX-WHLD-TO-DATE IN NEW-PAYROLL. 


GROSS-PAY-AND-AMOUNT. 
COMPUTE GROSS-PAY IN CHECK FROM SALARY-RATE. 
COMPUTE AMOUNT IN CHECK FROM GROSS-PAY MINUS 
SS MINUS FED-TAX. COMPUTE GROSS-TO-DATE IN NEW- 
PAYROLL FROM GROSS-TO-DATE IN PAYROLL PLUS 
SALARY-RATE. 


OUTPUT-FILE-MAKEUP. 
MOVE CORRESPONDING NAME IN PAYROLL TO NAME IN 
CHECK, NAME IN NEW-PAYROLL. MOVE SSNO IN PAY 
ROLL TO SSNO IN NEW-PAYROLL. MOVE SALARY-RATE 
IN PAYROLL TO SALARY-RATE IN NEW-PAYROLL. MOVE 
EXEMPTIONS IN PAYROLL TO EXEMPTIONS IN NEW- 
PAYROLL. 


ITERATION-AND-EXIT. 
WRITE NEW-PAYROLL. WRITE CHECK BEFORE ADVANC 
ING 5 LINES. READ PAYROLL-INFO-FILE RECORD; AT 
END GO TO HOUSEKEEPING-CLOSE. GO TO UPDATING- 
SSWHLD-AND-FED-TAX. 


HOUSEKEEPING-CLOSE. 


CLOSE PAYROLL-INFO-FILE, NEW-PAYROLL-INFO-FILE 
WITH LOCK, PAYCHECK FILE. 


7-7. Exercises 


7-1. Write the syntactical description of a Boolean expression for COBOL which 
nutomatically preserves the precedence order of the operations, 


If the value of the conditional variable SALE is CHAIR, INVOICE is BED, SALES 
MAN is JONES, and CUSTOMER is LEE, and if the value of the condition-name 
MERCHANDISE is “BED”, BILL is “CHAIR”, BUYER is “LEE”, and SELLER is 
“JONES”, then determine the truth value of the following Boolean expressions: 

7-2. SALE IS EQUAL TO MERCHANDISE AND SALESMAN IS EQUAL TO 
SELLER 

1-38. SALE IS EQUAL TO “CHAIR” AND CUSTOMER IS NOT EQUAL TO 
SELLER 

T-4. INVOICE IS LESS THAN “CHAIR” OR CUSTOMER IS LESS THAN 
SELLER 


1-6. SALE IS EQUAL TO “BED” AND SALESMAN IS EQUAL TO “JONES” 
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Complete the omissions in the Boolean expressions involved in the following 


ntences: 
€ 7-6. IF AIS EQUAL TO B THEN ADD X TO Y ELSE IF GREATER THAN C 


THEN ADD M TO N ELSE S1. 
7-7. IF AIS EQUAL TO B THEN ADD X TO Y ELSE IF GREATER THAN ADD 


M TO N ELSE ADD P TO Q ELSE S82. 
7-8, IF A IS GREATER THAN B OR EQUAL TO B ape ee TOY 
THEN IF GREATER THAN B THEN ADD C TO D ELS ‘ 
7-9. IF A IS EQUAL TO B AND X IS GREATER THAN Same ADD C TOD 
ELSE S2. 
ELSE IF EQUAL TO THEN ADD P TO Q ELSE S1 
7-10. IF A IS EQUAL TO B AND X IS GREATER THAN Y THEN ADDC TOD 
ELSE IF EQUAL TO THEN ADD P TO Q ELSE S1 ELSE S2. 


7-11. Make flow charts for the sentences of Exercises 7-6 and 7-7. 

7-12. Make flow charts for the sentences of Exercises 7-8, 7-9, and 7-10. 

7-13. Write the syntactical description of an arithmetic expression for COBOL, 
which automatically preserves the precedence order of the operators. 

7-14. Explain why a period is required at the end of a conditional statement a 
an else fragment follows. (Hint: Give examples of ambiguities that can result 
the period is not used.) 


Evaluate the following statements, where initially A = 3.7, B = 6.2, C = 0.0: 

and D = 10, and give the data-name of the heap ee ae 
- UTE PRICE FROM C TIM i 

aie SUAEOTE XX6P FROM A PLUS B TIMES C DIVIDED BY D MINUS 

7-17. ADD A, B, D GIVING TOTAL; SUBTRACT A, B FROM TOTAL. 

7-18. ADD A, B, C; SUBTRACT A, C FROM D; MULTIPLY D BY C. 

7-19. MULTIPLY A BY B; DIVIDE B INTO D. 

7-20. MULTIPLY A BY B; DIVIDE D INTO B. 


Write statements that compute the following: 

7-21. The price of 51 items at $2.75 per dozen, where we get a 25 per cent discoun 

7-22. The retail price of an item the wholesale cost of which is $70 per dozen wh 
it is desired to make a 20 per cent profit. 

7-23. The take-home pay of an employee who works 23 hr at $2.50 per hour 
65 hr at $2.75 per hour if he must contribute 5 per cent of his earnings to insuran 

7-24. min(A? — BC, A — B/C). 

7-25. min{max(A,B,C), max(D,E,F)]. 


7-26. Write a code to compute amortization tables. 


7-27. Write a code to compute WN. ; al 
7-28. Define a new statement format that calculates a price after taking a discow 


7-29. Write a code that computes a table of retail prices as a function of per ¢ 


profit and wholesale prices. ae 
7-30. Define a new statement format that calculates the retail price from 


sale price and per cent profit. ; 
7-31. Define a new statement format that gives an amortized monthly paym 


for a given mortgage principle, interest, and years to pay. 


a wh 


Write codes to evaluate the following, including the data-descriptions involy 


the OCCURS clause that may be required: 
7-32. Trace T of a matrix A(i,j) where 


n 


T= d AG, i) 
#1 
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7-33. Transpose of a matrix where 


A‘G,j) = AG,i) 


The element of a matrix with the greatest absolute value. 
An element of a table by the method of Fig. 3-11. 
The scalar product of two vectors. 


7-34. 
7-35. 
7-36. 


7-87. Prepare the record description that generates a table giving the prices of five 
different styles of women’s dresses, each of which comes in six different qualities of 
cloth, each of which occurs in four different colors. The table is to be accessed by the 
style-name, cloth-name, and color-name, with a subscripted variable. 

7-88. In Exercise 7-37 suppose that two of the styles were designed by DIOR and 
the other three by CASSINI; also suppose that three of the cloths were made by 
WILSON-MILLS and the other three by BARKER-MILLS; write a REDEFINE 
clause that will enable the dress price table also to be referenced by designer, cloth 
manufacturer, and color with a subscripted variable. 

7-39. Prepare the record description for an income-tax withholding table. 

7-40. Prepare the record description for a mortgage-amortization table. 

7-41. Identify the parts of the example of FD PAYROLL-DATA in See. 7-5. 

7-42. In the complete example of a payroll given at the end of Sec. 7-6 we neglected 
vonsideration of new employees, dropped employees, change of number of exemptions 
tlaimed, and change of salary rate. Redo the complete example to include these 
points. (Hinr: Include an additional input data-file called NEW-DATA; assume 
(that the PAYROLL-INFO-FILE will have the EMPLOYEE-records on the tape in 
ilphabetical order by last name; assume the same is true for the employee records 
(hat will be on the NEW-DATA file. Be careful that the final check is written for 
(he dropped employees.) 

7-43. Redo the complete payroll example of Exercise 7-42 including an additional 
output tape file upon which is calculated the employer’s monthly deposit of income- 
(nx withholding, the quarterly form 941, and annual forms 940, W-3, and the annual 
form W-2 for each employee. (Hint: Obtain the IRS employer’s instructions for 
reference. Note that the dropped employees must be given W-2 forms at the end of 
(he year during which they were employed. To take care of this case, use a level 88 
ilnuse to identify active and inactive EMPLOYEE records. Also use the Editing 
ilnuses to fill out the forms properly.) 

7-44. In Exercise 7-43 rewrite the payroll example including provision for hourly 
Wages and part-time employees as well. 

7-45. Include in Exercise 7-43 provision for withholding state income tax, medical 
iimurance, and retirement. 

7-46. Write a program in COBOL language for the processes illustrated in Fig. 1-12, 
(he inventory-control system. (This problem will require the student first to define 
(he ingredients involved more precisely.) 
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CHAPTER 8 


PROGRAMMING TO ACHIEVE INTELLIGENCE 


8-1. Advanced Concepts in Automatic Programming 


Achieving Intelligence. The purpose of this chapter is to introduce 
xome of the advanced concepts in automatic programming, with particu- 
lur emphasis on those aspects that deal with programming to achieve 
artificial intelligence. When we discuss programming methods intended 
(o impart a measure of “intelligence” to a computer, we do not mean the 
simulation of actual mental processes involved, but rather their rational 
reconstruction in an attempt to mimic their outputs or their structures. 
The goal is to determine effective programming techniques for accom- 
plishing “humanoid’’t tasks, imitating the human mental processes only 
when this proves to be the most effective way to do the job. 

Syntactical techniques prove to be of great value in programming to 
achieve intelligence. Therefore we devote the next two sections to a 
discussion of syntactical analysis, in particular as applied to automatic- 
programming-language translation. Besides presenting a more detailed 
study of the syntactical structures themselves, this discussion is of 
great importance in its own right for its consideration of automatic- 
programming-translation methods. In Secs. 8-2 and 8-3 we shall see 
how a single translator program can be utilized to translate any auto- 
matic program into a machine language, provided the automatic-pro- 
“ramming language is suitably syntactically defined. 

Next, the discussion turns to the problems of deductive and inductive 
Inference. Deductive inference is the process of drawing consequences or 
vonclusions from given premises, while inductive inference is the process 
of attempting to determine hypotheses or theories from which the given 
propositions may be deduced (see Sec. 10-13). Methods of inductive 
inference—namely, the so-called “heuristic programming’’—have received 
recent prominence as being associated with models of intelligent behavior. 
Ilowever, these developments should not overshadow the more “conven- 
ional’? mathematical methods in the area of artificial intelligence. These 
‘eonventional”” methods, such as logic, probability, linear and dynamic 





{ We borrow this term from H, A, Simon, See his Modeling Human Mental 
lrovessoa, Proc, Wealern Joint Computer Conf., May, 1961, pp. 111-119. 
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programming, game theory, etc., are associated with deductive inference 
and are the topic of Secs. 8-4 and 8-5. In Sec. 8-5 we apply such mathe- 
matical methods of deductive inference to formulating the humanoid task 
of making a medical diagnosis, interpreting in quantitative terms many 
of the decisions the physician now makes in a qualitative form. 

Having emphasized the more conservative mathematical approach, 
we can now turn, with a clear conscience, to the problems of inductive 
inference and the newer techniques of heuristic programming. The con- 
cept is introduced in Sec. 8-6; and the so-called ‘general problem solver” 
is illustrated in Sec. 8-7. Heuristic programming was developed in order 
to study the manner in which human problem solving is accomplished, 
such as the playing of games or the proving of geometric theorems or 
trigonometric identities. This field holds great promise for forming the 
basis of techniques that may enable truly intelligent problem solving by 
computers. 

The final section of this chapter is devoted to the consideration of 
abstraction and creativity. The problem of programming computers to 
conceive an abstraction will be approached through the use of syntactical 
analysis coupled with the methods of deductive and inductive inference, 
There are two closely related aspects to the problem—that of recognizing 
an object as a concrete example of an abstraction, and that of learning, 
or formulating abstract concepts from the characteristics of a given popu 
lation. The problem has not yet been solved, of course, but we hope to 
create an awareness of its existence and to present at least a glimmer 
possible promising methods of approach. 

The most human of humanoid tasks is perhaps that of creativity 
hence we end the chapter (Sec. 8-8) with a short discussion of an examp 
of creating a musical composition by a computer and a discussion of thi 
problems that confront the programmer if true creativity through com 
puter programming is to be achieved. In presenting this material, t 
author must act as a reporter describing current research approaches 
attempting to stimulate the reader to delve further into these new at 

untried areas on the research frontiers. 

List Processing. Programming to achieve intelligence characterist 
cally involves extensive symbol manipulation, and a technique is require 
for handling structures that evolve during the course of processing. If 
these purposes, automatic list-processing programs have been developed 
The threaded-list and push-down-list concepts introduced in Sec. 5-5 
the basic ingredients of such an automatic-programming language. It 
appropriate to make some further remarks here about list processitt 
since it will be understood throughout the rest of this chapter that uv 
type structures will be handled through list processing wherever required, 

As a brief illustration of how a list-processing type of language work 
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consider the list element of Fig. 5-8 (page 178) and call the three com- 
ponents of an element the link, the type or branch link, and the character, 
this latter being B, C, or E, as follows: 


Type or 
branch 
Link link Character 


List-element format [TT 


We may let the entire memory be composed of lists. The instruction list, 
having elements (i.e., instructions) of a different format from the rest 
of the data, will be processed separately. The unused memory words 
will comprise a push-down list. When the address of a list (i.e., the 
address of the first element of a list or sublist) starts with P, it is auto- 
matically understood to be a push-down list. 

Although we cannot here develop a full list-processing language, we 
shall give examples of the kinds of instructions required in such a lan- 
guage, illustrated by the manipulation of the lists of cards associated 
with the well-known game of rummy. The original deck of cards, 
assumed to be already shuffled, forms a push-down list, called the stock, 
to which we give the address PSTK. Suppose there are four players and 
each is dealt seven cards in a hand; let the addresses of the hand lists be 
I1A1l, HA2, HA3, and HA4. Then, in order to deal, we would need an 
instruction of the type 


Move element PSTK to end of list HAT 


Of course, the next element of PSTK would pop up. Initially J = 1, 
but I would be increased by 1 for each repeat of this instruction (i.e., 
the cards would be dealt one at a time to each successive player) until 
| = 4, when J would be reset to 1 again. When 28 cards have been dealt, 
the 29th card would be turned up to start the discard list, of address 
PDSC, by an instruction 


Move element PSTK to front of list PDSC 


Note that an element within a push-down list would never be directly 
addressed, by the very nature of the push-down-list concept. Now we 
are ready for the first play. The first player decides whether to pick a 
card from the diseard list or the stock list. If, for instance, he desires to 
choose the card from the discard list, an instruction of this type would 
be used: 

Move element PDSC to front of list HAJ 


Then the player examines his hand to see whether or not he has three or 
four of a kind, or a run of three or more in the same suit. This would be 
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accomplished by instructions of the kind 


AUTOMATIC-PROGRAMMING LANGUAGES 




















Compare the type of the 7th element of list HAJ with the 
type of the jth element of list HAT 


If three or four of a kind, or a run of three or more, are found, then a ne 
list is initiated, called a spread list, SPJ, with an instruction of the for 


Move elements 7 through j of list HAJ to front of list SPJ 


where J will be given a different value of each spread. The second play 
would go through the same routine, etc. When a player has no car 
remaining in his hand he stops the game and counts his winning point 
Hence we would need an instruction of the type 


Find the number of elements in the list HAJ 


When looking for three or four of a kind or for runs of three or mo 
a reshuftling of a hand may be convenient. This would require instru: 
tions of the type 


In list HAJ find the address of element in list with type T 
In list HAJ remove element 7 and insert it after element 7 


Note that, if a symbolic machine-language instruction type W 
desired, it would be natural in our case to have a two-address instructi 
format of the form 

a B Operation 


ey a ee (Saar Sees 


However, some of the instructions described above have up to f 
addresses. This could be taken care of by having several index re 
ters to store the extra required addresses temporarily. Of course 
address-assigner routine of the type described in Sec. 5-7, in combinat 
with a simulational routine of the type described in Sec. 5-4, would 
required to compile or interpret such two-address symbolic machi 
language instructions. 


8-2. Automatic-programming-language Translation 


Syntax-directed Automatic-programming-language Translation, 
Chap. 5 we briefly described some methods that could be used to com 
automatic-language “pseudoinstructions’”’ in terms of machine-langu 
instructions. For each pseudoinstruction of the automatic language, 
subroutine is written that can interpret, or “translate,” the pse 
instruction into a sequence of machine-language instructions. In 
section, on the other hand, we shall describe an entirely different 
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cept for automatic-programming-language translation,+ which requires 
only a single subroutine to perform the entire translation. The differ- 
ent. pseudoinstructions are characterized by their respective syntactical 
and semantic descriptions, and the single subroutine is designed to 
translate any such syntactical and semantic description into the machine- 
language instructions. 

The advantage of such a concept is that changes and modifications in 
the automatic-programming language can be made without regard to the 
machine-language-translation problem, for such changes would involve 
the alteration of only the syntactical and semantic description of the 
language. In fact, the same translator program would be able to trans- 
late many different automatic-programming languages (that are suitably 
defined) into machine language. Also, the translator program can trans- 
late from one automatic-programming language to another automatic- 
programming language. This latter consideration would be of impor- 
tunce when, for example, it is desired to translate a general language 
such as ALGOL or COBOL into a specific automatic language, which 
may have been devised by the computer manufacturer, etc. Hence the 
method offers a possible solution to the burgeoning ‘tower-of-Babel”’ 
phenomenon that is occurring in the automatic-programming-language 
licld (see Fig. 8-1). 

To describe this new approach, it will be convenient to define a few 
new terms for old ideas. Language translation is said to take a source 
language into a target language.t In our case, the source language is the 
original automatic-programming language, and the target language is 
the machine language (or other automatic-programming language) in 
terms of which the final translated result is to appear. By the definition 
of a language we mean the syntactical and semantic description of the 
language. Note that the definition of a language must be stated in terms 
of some other language. For example, our definitions of ALGOL and 
COBOL of the previous chapters were stated in terms of English. How- 
over, for language-translation purposes, the definition of the source language 
must be given in terms of the target language. In fact, much of the burden 
of the success of the translation process is placed on the accuracy and 
vompleteness with which these automatic-language syntactical and 
womantie descriptions are made. The translator is the program, written 
in a metalanguage, which operates only on syntactical and semantic 
structures. 


{ The methed described is based on a concept of E. T. Irons; see E. T. Irons, A 
Myntax Directed Compiler for ALGOL 60, Communs, ACM, vol. 4, no. 1, pp. 51-58, 
January, 1961, 

{ If the target language is the final machine-language code, the term object language 
in frequently used, 
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Symbolically, we can represent the situation as follows: Let $(8) and 
3(&) be the definitions of the source language and target language, 
respectively, in terms of English, such as we have given in previous 
chapters for ALGOL and COBOL. For our translation method, we 
really desire the definition §(3)—that is, the definition of the source 
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Fie. 8-1. “Tower of Babel,’’ illustrating the programming-language-translation pr 
lem. The expressions in the blocks of the tower are names of just a few of the exist 
automatic-programming languages. (Adapted from the January, 1961, cover 
Communs. ACM, vol. 4, no. 1, with permission.) 


language in terms of the target language—where 3 = 3(&) is given by 
say, the computer manufacturer’s programming manual. Now t 
translator can be considered as a transformation 7’ such that T($(3)| = 
with the property that, if @($) is a program written in the source langu 
then T[(3)] = P[T(8)] = &(3) [where (5) is the translated program 
terms of the target language]. Thus the translator operates by “looking 


Src. 8-2] PROGRAMMING TO ACHIEVE INTELLIGENCE 301 


the successive symbols of a program written in the source language, comparing 
these symbols with those of the definition of the source language, and trans- 
forming them into symbols of the target language, according to the syntax 
and semantics of the description (see Fig. 8-2). 











Metalanguage description §(3) 


Program 0@(§) writ- 
of the source language § in -+| Translator fe ten in the source 
terms of the target language 3 


language § 


Translated program @(3) in 


terms of target language 3 





Fia. 8-2. Use of translator: T{[§(3)]} = ®{T[§@)]} = (3). 


The Syntactical and Semantic Descriptions. We shall introduce the 
notion of the syntactical and semantic descriptions by means of the 


- 
PaBLe 8-1. SYNTAX AND SEMANTICS FOR TRANSLATING ENGLISH INTO KoREANT 


Syntaz Semantics 
cto 
1 (noun) :: = boy|girl =s 
Ap Ar 
2 (adjective) :: = tall|short =: |x 
eu 
/definite\ |. 
\ article /** ne = (empty) 
ea g 
4 (verb) :: = loves => st 
+ 
al 
a definite 
5 (n-phrase) :: = (noun) | Gece )inephrase) => {Ci} | {Ci} 
. . C 
Kadjective)(n-phrase) ri 
{C1} 
(Ci 
6 (object phrase) :: = (verb)(n-phrase) => = 
{C2} 
7 (subject) :: = (n-phrase) => CF 


8 (sentence) :: = (subject)(object phrase) ae aH 
1 


} Growsly simplified for illustrative purposes, 
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example of translating English as the source language into Korean as the 
target language. It is important to observe, however, that, even though 
we are using a written-language translation as an example, it can be 
shown that the method to be presented is not sufficient for actual human- 
language translation, although clearly the method is one of the ingredients 
of that process. Indeed, the illustration is intended only to emphasize 
the important aspects of automatic-programming-language translation, 
The syntactical definitions given in Table 8-1 indicate the grammatic 
parts required to translate the sentence 


the tall boy loves the short girl 


The metalanguage notation is the same as that used in the descriptio 
of ALGOL and COBOL: e.g., the first line says that a ‘‘noun” can 
either boy or girl; the fifth line states that an ‘‘n-phrase” (noun phrase 
can be an adjective followed by an n-phrase, or a definite article follow 
by an n-phrase, or a noun. Thus we could analyze (or diagram) t 
sentence as follows, according to the recursive use of these rules 
syntax: 











the tall boy loves the short girl 
___fJ,_ 4], _4] ___ 4 
( deGnite\ adjective) (noun) (verb) { defn ite\ (adjective) (noun) 
\ article / \ article / 
(n-phrase) 
(n-phrase) 
(6. 
(n-phrase) (n-phrase) 
(subject) (object phrase) 





(sentence) 


Although we have used this process in Chap. 6, let us observe the sit 
cessive steps more closely (see numbers on the brackets): (1) Choosi 
the first word of the sentence, namely, the, we look on the right-hand 
of the syntax of Table 8-1 to identify its syntactical form; the lines 
Table 8-1 are observed successively from the top down; line 3 indie 
that the is a “definite article.” The next step is to determine whether 
not a “definite article’ can be developed as a part of a higher fo 
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starting again from line 3 of the syntax and working down. Line 5 has 
the term as part of the second alternative from the right—if an n-phrase 
follows the, then we can incorporate it into a larger n-phrase. (2) To 
see whether or not this development can be made, we choose the next 
word of our sentence, namely, tall, and establish it as an adjective; the 
first (rightmost) alternative of line 5 could be used to incorporate tall 
into a larger form if we only had an n-phrase to its right. Hence (3) 
we go on to the next word of the sentence, namely, boy. By syntax 
line 1, this is a noun; by the third alternative from the right on line 5, we 
lind (4) that the noun can be called an n-phrase; again by syntax line 5, 
first alternative on the right, we can (5) combine our adjective (tall) 
with the following n-phrase (boy) to form a larger n-phrase, and so forth. 
Note that we could not use line 7 yet, because we must use the syntax 
rules in precedence order from top to bottom, starting each time at the 
line at which the previous syntax form was completed. Hence we can 
use line 7 only when we have exhausted all possibilities of previous occur- 
rences of an n-phrase in a syntactical rule. This occurs after we have 
inade the tall boy an n-phrase, because this n-phrase is not preceded by 
wnother adjective or a definite article (as required by line 5) or by a verb 
(us required by line 6). 

Mach syntax line of Table 8-1 has an associated semantic part, to the 
right of the =, giving the Korean description of the corresponding syntax 
alternatives. For instance, the semantics of line 1 of Table 8-1 gives the 
IXorean words for boy and for girl; line 3 states that Korean has no definite 
urticle. The semantic descriptions of lines 5 through 8 require further 
explanation: the {C,} stands for the Korean translation (or target- 
language code) of the corresponding grammatical form—i.e., of the word 
(that the corresponding grammatical form represents at the time of inter- 
rogation. For example, consider the noun boy; in using line 5 to form an 
n-phrase of this noun, the corresponding {C,} represents the Korean 
\ranslation of boy (see line 1). The reason for the subscript is that fre- 
uently more than one term or component enters into a syntax alter- 
itive, e.g., (adjective)(n-phrase) in line 5 of Table 8-1. To distinguish 
hetween the translations of each of these components, we number each 
component of an alternative from right to left and index the {C,,} accord- 
ingly. For example, consider the first (rightmost) alternative of line 5, 
\ogether with its corresponding semantic description. Korean is written 
in columns (like Chinese) rather than in rows (as English is). The 
wemanties says that an adjective followed by an n-phrase is translated 
into Korean with (the Korean translation of) the adjective on top, the 
xymbol & under this, and then (the Korean translation of) the n-phrase 
under that, In Fig, 8-3 we have completed the translation of our sentence 

ncoording to the syntactical analysis described above, in conjunction with 
the semantic descriptions given in Table 8-1,. Mor a thorough under- 

















shown in Table 8-2. 


TABLE 8-2. LEVEL-NUMBER REPRESENTATION OF THE Syntax oF TABLE 8-1 


AUTOMATIC-PROGRAMMING LANGUAGES 


standing of this relatively simple process, the student should follow in 
detail each of the steps of Fig. 8-3 as indicated by the numbers on the 


Note that in Table 8-1 we have numbered the lines, or ‘“Jevels,”’ ascrib- 
ing “level numbers” to the terms on the left of the syntax descriptions. 
These level numbers can entirely replace the terms in such definitions, as 





(sentence) 
The 


Above the horizontal 


The boldface words can be considered as “literals,” 


since they are not generic entities but are actual words of the sour 
The use of level numbers assists the analysis, provided we 
assign the level number ‘‘0” to the literals. 
manner entirely equivalent to that described above, where now we thin 
of each symbol or combination of symbols in the sentence as being su 
cessively replaced by one with a higher level number in accordance with th 
syntax rules, in an attempt to obtain the highest level number which wi 
Of course, each time a replacement is ma 
the corresponding semantic descriptions, or translations, are aligned, 


illustrated for our example in Fig. 8-3. 
transformations of the analysis would appear as follows: 


the tall boy loves the short girl 


tall boy loves the short girl 
2 boy loves the short girl 
1 loves the short girl 
5 loves the short girl 
loves the short girl 
loves the short girl 
loves the short girl 


replace the entire sentence. 


Step number 


9 
10 
11 
12 
13 
14 
15 
16 


The student should trace each step in conjunction with Table 8-2 


(object phrase) 


| 


Fra. 8-3. Successive stages in the translation of an English sentence into Korean, for our illustration 
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girl 
lines are shown the successive changes in the syntax source-language list and below each is shown the Korean translation 


numbers on the brackets indicate the order in which the steps are made. 
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The Translator. With the experience gained in our English-to-Korean 
example we can now proceed to describe the translator itself in detail. 
The translator simply manipulates a list and its sublists. The list is the 
source (-language) code list, which initially is the sentence (or code) in 
the source language (English); the sublists, called the target (-language) 
fragments, are the translated fragments in the target language (Korean) 
which are generated as the syntax analysis takes place. Each time a 
larger syntax entity (i.e., one of higher level number) is found, the larger 
entity replaces the components of which it is made in the source code list and 
the associated translated parts combine to form larger target fragments. 
Finally, the source code list becomes a single element—namely, the 
“sentence’—and the associated translated target fragments merge to 
form the list that is the desired translation. Of course, the level num- 
bers themselves can be used in the source code list instead of their corre- 
sponding syntactical entities, as illustrated above. 

Before proceeding further, let us more clearly define terms that describe 
the parts of a syntax definition. Each definition is a line, and the line 
number is taken as the level number of the corresponding syntacti¢ 
entity (on the left of the ::=) of that line. The possible forms listed 
the right of the syntax definition (to the right of the ::=) are calle 
alternatives; the entities that make up an alternative are called components, 
Thus in the line 


/definite\ 


\artinle] (n-phrase)|(adjective)(n-phrase) 


(n-phrase) :: = (noun) 


there are three alternatives, namely, 


/definite\ 


\ article / (adjective)(n-phrase) 


(noun) (n-phrase) and 


and the two components of the rightmost alternative are 


(adjective) and (n-phrase) 


The components can be expressed directly as their respective level nu 
bers. The source code being translated is said to be composed of a I 
of symbols (words), where a symbol can be a literal or a level numb 
The level number of a literal is assumed to be zero. We shall denote 
key symbol that symbol of the source code under consideration at & 
stage of the analysis process. 

Although the description of the translator appears lengthy, it car 
out a relatively simple recursive processing of the syntax and symb 
using only the operations of the comparison of a symbol and a compon 
and the selection of an adjacent symbol or component for proces! 
Figure 8-4 gives the rough flow chart of the translator, Box | gives 
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Start with 
leftmost symbol (called initial key symbol) 
first line 
rightmost alternative 

rightmost component 





















Search 
for 
key 
(matching) 
component 


Compare, from right to left, the level number 
(or literal) of each successive component 
with the level number (or literal) of the key 
symbol; go to the next alternative and next 
line as necessary, until a successful com- 
parison is made; call this component the 
key component. 
























Does this 
syntax (i.e., 
alternative 
containing the 
key component) 


Within the alternative of this key component: 
compare the corresponding component and 

symbol level numbers, first successively to 
the left and then successively to the right of 
the key component and key symbol, until the 
one is > or < the other, or else all are =. 

























If a component level 
number < the corres- 
ponding symbol level 
number, then go on to 
next alternative, 
searching for same 
key symbol. 


Replace 
symbols by 
equivalent 
syntax 
designation ; 
restart 
search on 
same line 
for this 
syntax 
designation. 


If all the components 
of the alternative 
correspond to the col- 
lection of symbols 
around the key symbol, 
combine the translated 
parts to form a larger 
fragment. Then the 
collection of symbols 

is replaced by the line 
number of the alterna- 
tive as the new level 
number symbol; this 
new symbol becomes the 
new key symbol, and 
the search restarts on 
this same line. 

































continue 
search. 




















No fit 
but sym- 

bol is develop- 
able; restart 
search for it on 
line of ita level 
number, 


If a component level number > the 
corresponding symbol level number, 
then shift to this corresponding 

symbol as the new key symbol; re- 
start the search on the line with 
the same number as the level 

number of this new key symbol. 








Fra, 84, Grows flow chart of the translator, 
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starting point: the initial key symbol is the leftmost symbol of the original 
source code to be translated, and the first component under consideration 
is the rightmost component of the rightmost alternative of the first syn- 
tax line. Box 2 describes the search through the successive components 
for one that will match the key symbol, the search proceeding from right 
to left from the top line down. (The component with which the match is 
made is then called the key component.) 

Recall from our above examples that a syntactical replacement can be 
made only when all the components of an alternative completely match, 
or “fit,” a set of adjacent symbols of the source code. Box 3 system- 
atically performs the necessary comparisons, first to the left and then to 
the right of the key symbol and of the key components. In our above 
example, comparisons around the key component never occurred to the 
left and right at the same time; but in general both directions can be 
required when there are three or more components in an alternative (see 
the examples in the next section). For instance, suppose that the key 
component (kes) were in the alternative 


- + + |\(beae){kes)(xe){ye)] + °° 


and that the key symbol (which must be equal to the key component) 
were in the source code list 


- + + (bs)(as)(kes)(xs){ys) * °° 


Then the comparisons of components and symbols would be made in th 
following order: (ac) with (as); (be) with (bs); (xe) with (xs); and (y@ 
with (ys). 
oe only the level numbers need be compared, and three Cason 
may result. First, suppose every component (level number) in the alte 
native of the key component is equal to its corresponding symbol 
number) in the set about the key symbol in the source code ; then a hig’ 
level number, namely, the number of the line of the alternative, repla 
the set of corresponding source code symbols. The previous translati 
fragments associated with all these symbols are combined, and the sea 
of box 2 is restarted at this same line (as in our example above, where 
and 5 were replaced by 5, and we used line 5 to replace 3 and 5 by 5, an 
so forth); this is accomplished in box 4. Second, suppose that duri 
the comparisons of box 3 a component level number is less than the ¢ 
responding symbol level number; then this symbol certainly could ne 
be developed so as to meet such specifications, since symbol level nu 
bers can only be raised. Hence no fit can be made in that alternatl 
(see box 6), and the search continues as in box 2. Third, suppose du 
the comparisons of box 3 a component level number is greater than @ 
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responding symbol level number; this indicates that perhaps the symbol 
level number could be developed to match the higher component. level 
number. Hence the procedure of box 5 is followed, and this symbol is 
chosen as the new key symbol; the search of box 2 is restarted on the line 
having the same level number as this new key symbol. 

If the syntax is properly formulated and the sentence, or source code, is 
properly formed, then the process should end with the entire source code 
having been replaced by the highest level number and the translation— 
which is going on with each replacement—having been completed. The 
student is now in a position to try the translator of Fig. 8-4 on the exam- 
ple described above. A precise flow chart of the translator can be made 
as in Fig. 8-5, provided that the following indices are understood: 

Let s index the symbols of the code being translated, from left to right. 

Let S index the lines of the syntax description, from top to bottom. 

Let 7 index the alternatives of a line, from right to left; hence S,i 
designates a particular alternative. 

Let 7 index the components of an alternative, from right to left; hence 
S,i,j designates a particular component. 


8-3. Examples of Translating ALGOL Codes into a Symbolic Machine 
Language 


The Source and Target Languages for the Examples. Consider the trans- 
lations of the simple arithmetic expression and the assignment statement 
of ALGOL, as the source language, into a symbolic machine language, as 
the target language. The target language for the purposes of our exam- 
ples is described in Table 8-3 and is simply a one-address symbolic instruc- 
tion system. We shall assume that a compiler has already been written 
that can assign addresses to a code written in this symbolic machine 


TABLE 8-3. DEFINITIONS OF SYMBOLIC-MACHINE-LANGUAGE INSTRUCTIONS 
to Be Usep as THE TarGET LANGUAGE FOR THE EXAMPLEST 


CLA a Replace (accumulator) by (a) 
STR « Transfer (accumulator) to a 
RTP a Raise (accumulator) to the power of (a); leave 
result in accumulator 
MUL a Multiply (accumulator) by (a); put product in 
accumulator 
DIV a Divide (accumulator) by («); put quotient in 
accumulator 
ADD a Add (a) to (accumulator); put sum in accum- 
ulator 
SUB a 


Subtract (a) from (accumulator); put difference 
in accumulator 


| Addresses P, M, and Q are each push-down lists which push down whenever an 


STR instruction is used and pop up on the use of the address by any other instruction. 
































———————— 


TaBLE 8-4. DEFINITION OF SrmpLE ARITHMETI 


c EXPRESSION AND ASSIGNMENT 


SraremMENT or ALGOL{ 


(a) Actual terms 




















A|plc/DizlE\simUIlE| 
1 var) = 4) )clplElFIG AJL K 
2 (eq) 2: = lenny) 
3 (p) i= ou 
en ee 
4 (mult op) :: = MUL | DIV 
) 
5 (close) :: = Gaus 
spit | = 
6 (add op) :: = ADD | SUB 
( 
7 (open) :: = not 
. __ éyar)(eq) 
8 (ft pt) 2: = “TCs 
(var) (open)(sae)(close) 
240" SAIC) {Ca} 
at) | GDM) 
FP GT) Tn 
STR P 
{Cs} 
RTP P 
(f) | (mnf)(mult_op)(f) 
11 (mf) ::= Tei a 
STRM 
{Ca} 
{Co] M 
(mf) | (sae)(add op)(mf) 
12 (sae) :: = 1G Cy 
STRQ 
{C3} 
{C.]Q 
‘ ft pt){sae) 
/ assignment\ mes dl 
18 \ statement / {Ci} 


STR [C2} 


+ To be used as the source language of our sll in ieee 

i ve have place e se , . 

oa aracaier ie ee Ne We in hae added [C,,} and {C,] to our {Cy} no 
‘ther side indicates that the last instruction of the targ 


respective syntactical alternative. 
tion: the square bracket on el 
language fragment is incomp 


(Ci) CLA A 
Thus, if {C»] is eee and [C,| is Q, then ict becomes ADD Q 


10 :: 


11:: 


12 :: 


13 :: 


(b) Level numbers 
A|B|C|D|E | G al 
=F c GH] J 
(empty) 
T 


K|L 


H 


F 


Re 
D 

















“+ (empty) 





"+= empty) 


+ _ 


= ADD | SUB 





1 | 7,12,5 


= CLA [Gil | {Cs} 


9 |10,3,9 
{Ci} | {Ci} 
STR P 
{Cs} 
RTP P 
10 | 11,4,10 
{Ci} | (C1) 
STRM 
{Cs} 
{C.]M 
41 | 12,6,11 
{Cr} | {Ci} 
STRQ 
{C3} 
{C.] Q 

8,12 
{Ci} 

STR [C2} 





the target langu 
directly under 


lete on that side and is to be completed at a later #t 
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language. (See Sec. 2-6 for a discussion of symbolic coding and Sec. 
5-7 for a discussion of an address-assigner program.) Hence our problem 
is to translate a code from the source language ALGOL into the target, 
symbolic machine language; this accomplished, a translated code would 
then have specific addresses assigned and be run on the computer. 

The source language for the purposes of our examples is defined in 
Table 8-4, column a, where we have given with slight simplifications the 
syntax of the ALGOL simple arithmetic expression and assignment 
statement (see Secs. 6-3 and 6-4). In this table we have also formulated 
the semantics of these ALGOL definitions in terms of our illustration 
symbolic-machine target language. In column a we have placed the 
semantic descriptions or the target-language translated fragments under 
the corresponding syntactical alternative (rather than to the right of 
un => as in the above Korean translation example). In column b we 
repeat the same source language definition using level numbers. We 
ure now ready for the translation of our first ALGOL code. 

Examples of Translation. Consider the code 


F:=A+BxC 


Applying our translator of Fig. 8-4 (or Fig. 8-5), we find the work shown 
in Fig. 8-Ga, where the numbers on the brackets indicate the order of the 
steps. Note from column a of Table 8-4 that first F is identified by line 
| asa (var). Then line 8 requires that the next symbol := be examined 
(i.e., become the key symbol); this is identified by line 2 as (eq), from 
which, again by line 8, we obtain the (Ift pt) of bracket 3 (see Fig. 8-6a). 
sy line 13 (of Table 8-4, column a), the next symbol A becomes the key 
symbol, which is identified as a (var) by line 1; by line 8, we go on to the 
next symbol +, which is identified by line 6 as an (add op). By line 12, 
(he previous symbol (namely, (var)) becomes the key symbol. Although 
(var) appears in line 8, no match is made because the level number of 
(eq) is less than the level number of the symbol to the right of the key 
symbol, namely, (add op), and thus (add op) cannot be developed into 
(eq). Hence (var) becomes (t) by line 9. Looking at line 10, component 
2 of alternative 1 is (p) with level number 3, which is less than the level 
number of the symbol preceding the key symbol (namely, (Ift pt)); hence 
we go on to alternative 2, whence (t) becomes (f). Line 10, alternative 1, 
requires (p) to the right of (f); but since the level number of (p) is less 
(han the level number of the symbol to the right of the key symbol, 
namely, (add op), we go on to line 11, On line 11 we find, analogously 
(0 the reasoning just followed, that our key symbol (f) becomes (mf); 
und on line 12, that (mf) becomes (sae) (see brackets 7, 8, and 9 of Fig. 
S-6a), However, again on line 12 of the table, we see that our key 
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symbol, which is now (sae), actually has on its right an (add op) as 


a required by alternative 1, component 2; hence we choose the next symbol 

B Blu o# ie) =< z - B as the new key symbol and see whether or not it can be developed into 

a 8 < a S Q a 5 & an (mf). It becomes first a (t), then an (f), and eventually, by line 11, 

9 S|URn0< Boz G an (mf) (see brackets 10, 11, 12, 13, and 14 of Fig. 8-6a). However, on 
a7 


line 11, alternative 1 can now be used, since (mult op) has already been 








O_<z formed (in bracket 11). Thus we choose C as the key symbol, which 
3 <° < oz <2 develops into an (mf) by absorbing the previous two symbols: i.e., by 
z dE AE dS line 11, alternative 1, 


| 


(mf) +: = (mf)(mult op)<f) 


U oO a e) > < 2 
a¢57<¢8 <3 (see bracket 18). Finally, by line 12 of the table, we combine this (mf) 
B52509 £ as 
“dkd< & oD 2 


with the preceding (add op) and (sae), forming an (sae) in bracket 19; 
then by line 13 the assignment statement is formed in bracket 20. Note 
that we could have accomplished the entire process by utilizing the level 
numbers alone (see column 6 of Table 8-4 and the work in Fig. 8-6b). 
The translation of parentheses is illustrated by 


| 


(t) 
LAC 
STRQ 
CLA B 
ADD Q 





Cc 


| 


Fic. 8-7. Work for translating F =AX(B+C). 


F:=Ax(B+C) 


(sae) 
CLA C 
STRQ 
CLA B 
ADD Q 


the work for which is shown in Fig. 8-7. Note that the expression 
hetween the parentheses becomes an (sae) before the parentheses are 
absorbed in the formation of the (t). 


As another example consider the translation of 


(sae) 
CLA B 


F:=ATB+CxDxE 


“ 


the work for which appears in Fig. 8-8. The student should carefully 
check the successive steps, identifying for each the syntactical line used 
from Table 8-4 and noting the sections of the translator flow chart 
(lig. 8-4 or 8-5) utilized. The student should also check the final code 
to see that it actually does compute the desired arithmetic expression. 
Meveral observations can be made about this code. First, note that the 
address M must be a push-down list, for first the code stores (E) in M, 
and then (D) is stored in M. Obviously (E) and (D) must be stored in 
different cells; however, since (D) is stored last, it is utilized first for 
multiplication, allowing (E) to pop up for the next multiplication (see 
footnote in Table 8-3). The second observation concerns the fact that 
(he code resulting from the translation is far from minimum; it is not har¢ 
(ov code this function using fewer instructions. That this is true is seen, 
for example, in the first two instructions: first (E) is put into the accumu- 
lator, and then in the very next instruction it is re-stored. In order to 
eliminate such unnecessary instructions, we must reformulate the syntax 
and semantic definitions in more detail, 





(mf) 
CLA A 
(mf) 
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Example of a More Efficient Syntax and Semantic Description. In 
Table 8-5, we present a collection of syntax and semantic Cescriptions 
that will produce a near-minimum code. For example, in Fig. 8-9 we 
have shown the work for translating the same assignment statement as in 
Fig. 8-8, namely F:=ATB+CxDxE; now only 8 instead of 14 instrue- 
tions are required. In Fig. 8-10 we have illustrated the work for 
translating 


A:=(BxC)T (D+E)/F—G)+HTJ—K)+L 


The reader will benefit greatly by first observing the final code and how it 
builds up the desired function, and then studying the work in Fig. 8-1 
and how the translator builds up this final code. Although we have 
written the syntax in terms of syntactical entities for ease of study, w 
have given the example of Fig. 8-10 in terms of level numbers for ease i 
tracing. In building up the translated code in this near-minimu 
fashion, the translator almost appears to be “intelligent.’”’ This behavi 
of the translator, however, is clearly entirely dependent on the “efficiency’ 
of the syntactical and semantic description. 


8-4. Deductive Inference: Mathematical Optimization 


Modern Mathematical Decision Theories. Planning, controlling, ada 
ing, self-regulating, optimizing—all these describe processes which a 
attributes of “intelligent” systems, and programming a computer to 
in such processes can be interpreted as imparting a measure of “inte 
ligence” to the computer. In this and the next section we shall be ¢o 
cerned with elementary illustrations of how a computer can be made 
exhibit such attributes through deductive inference. We classify mode 
mathematical decision theories as deductive-inference methodolo 
because these mathematical methods are the outcome of or conclusi 
drawn through logical (mathematical) deduction made from gi 
premises; the methods produce optimum or measured near-optim 
solutions and are consequences of mathematical results. We use t 
term deductive inference, as opposed to inductive inference, to deseri 
intelligent behavior based upon deductive mathematical optimizing 
regulating methods, as opposed to hypothesis making and intellige 
trial and error (which are the subjects of Sec. 8-6). 

To illustrate these points, we shall consider in this section selee 
examples of modern mathematical decision theories, namely, li 
programming, game theory, and dynamic programming. (The te 
‘‘programming”’ used in the names of these mathematical methods d 
not mean computer programming; it is intended rather to imply “Dp 
ning” or “problem analysis.”) In the next section we shall show ho 
typically “intelligent” process, that of making a medical diagnosis 
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treatment plan, can be formulated in terms of an integrated sequence of 
such mathematical decision-making methods—i.e., in terms of deductive 
inference. 

The reader may wonder why we consider what are primarily mathe- 
matical methods in this chapter on programming to achieve artificial 
intelligence. The reason is, of course, that whatever measure of intel- 
ligence is achieved by a computer arises from mathematical procedures 
carried out by the programs and routines inside the computer. Actually 
the role of the computer is twofold: first, the fact that a computer is és 
be utilized forces the precise and unambiguous analysis of the approach— 
covering all alternatives and eventualities—so that a “‘programmable”’ 
process results; and second, the availability of the computer makes 
feasible approaches to problems that otherwise would never have been 
it tempted. 

Linear Programming: Decision under Certainty. Decision processes 
nure characteristically associated with three types of ingredients: the 
list of possible states of the system, the list of alternative decisions that 
oun be made affecting the system, and the collection of values which 
ix associated with each decision-state pair. When the state of the system 
ix known, we say that the problem is one of decision under certainty; 
when only the probability is known of the system being in each of the 
possible states, we call the problem one of decision under risk; and when 
no knowledge is available about which state the system is in, thi problem 
is known as one of decision under wncertainty.t Consider first the sim- 
plest example of a problem of decision under certainty. Suppose that a 

patient has disease d; and that there are available two alternative treat- 
ments, T, and Ts, with prognosis related values Vi; and V2;. The situa- 
(ion can be represented by the following array: 


d; System state 


Ti Vii 


Alternative decisions 
T2 Vo; 


Decision-state values 


‘Trivially, whichever treatment corresponds to the greatest value is 
chosen. 

A more complicated decision problem under certainty can involve 
linear programming. For example, suppose that there were two possible 
(reatments for a cancer patient, drug administration and radiation 
(herapy, and that, in terms of some common units of effectiveness (e.g 
hy bioassay), the drug contains 1,000 curative units/oz and the radiation 


| See, for example, R, D, Luce and H, Raiffa, “Games and Decisions,” John Wiley 


& Sons, Inc,, New York, 1957, 
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gives 1,000 curative units/min. Suppose the patient must recetve at 
least 3,000 curative units of treatment. However, both treatments are 
toxic to the patient so that, again in some common units, the drug 
contains 400 toxic units/oz, while the radiation produces 1,000 toxie 
units/min; suppose the patient must not receive more than 2,000 toxi¢ 
units. Finally, in terms of discomfort experienced by the patient, sup= 
pose that the drug was three times as noxious per ounce as the radiatio 
per minute. The problem is to determine the doses that will satisf 
the curative and toxic criteria of the treatment but will minimize the di 
comfort to the patient. If X1 is the number of ounces of the drug to 
prescribed and X, is the number of minutes of radiation to be received, 
then to cure the patient we must have 


1,000X, + 1,000X, > 3,000 
but because of the toxic effects, 


400X, + 1,000X, < 2,000 


The problem is to determine X, and X» that satisfy these inequalities a 
minimize the “discomfort”’ 


D = 3X,4+ X2 


Since in addition X; > 0 and X, > 0, the solution lies in the crosshate 
area of Fig. 8-11. However, this last equation is the equation of a pla 
the points of which give the value of the discomfort D for every pair | 
values Xi, Xo. Clearly the minimum value of D (consistent with 
restrictions on X; and Xz) must lie on one of the vertices of the triangu 
area on the D plane whose projection is the crosshatched area on the Xy 
plane. Hence check the value of 3X1 + X, at each of the three poi 
(3,0), (5,0), and (54,44), finding D to be, respectively, 9, 15, and 6 
Thus combined treatment is indicated at 54 oz of drug and 44 min 
radiation. The problem we have been solving is of course manife 
unrealistic, but only because of the oversimplification inherent in 
such illustrative example. , 

Other, more realistic problems involve a multitude of factors in m 
inequalities. Such problems may involve optimization (or minim 
tion) of factors associated with the organization of industrial econo 
planning optimum military-defense strategies, or choosing a redu 
diet. As we have just illustrated, the linear-programming prob 
involved is the optimization of a linear function subject to linear 
straints. In general, realistic problems are so large that the use of 
puters is required to make their solution feasible. 

Game Theory: Decision under Uncertainty. If there are several po 
states in which the system may be, say dy, da, . . « » dn with reape 
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Fia. 8-11. Solution to the drug-radiation linear-programming problem. 


probabilities Pi, Ps, ..., Pn; if the alternative decisions are 7, T:, 
... , Tmjand if V;; is the value of decision 7; when the system is in state 
(d;, then we can make the following array: 


d, ds we oe dg 
P; Po +++ Pa 
T, Vi Vie +++ Vin 
T2 Va Voe Von 
Tm Vint Vine ieee Vinn 


The choice of a decision that will maximize the resulting ‘expected” 
vilue is found by computing 

E; = > ViyP; (8-1) 

j=l 

for each decision 7; and choosing that decision with the greatest H;. For 
instance, consider 7,000 patients for all of whom two tentative possible 
diagnoses (d; or dy) have been made with probabilities 57 and 27, respec- 
lively. Suppose that there exists a treatment 7, which is 90 per cent 
offective against disease d, and only 380 per cent effective against d», 
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that is, Vir = 0.9 and Viz. = 0.3, and a treatment T2 which is 10 per cent 
effective against d; but 100 per cent effective against ds, that is, Vor = 0.1 
and Vox = 1.0 (see Fig. 8-12a). Under which treatment would we expect 
to cure the most patients? Simple substitution in Eq. (8-1) shows that 


Ey = 51 0 and Ky = 2546 


aK 


and hence we would expect to cure more patients with treatment T. 
However, for any particular patient we do not know which of the tw 
diseases he has, and therefore the situation is one of risk for that patient 
even though the treatment of choice will maximize the probability 


curing each patient. For two diseases, in general, if we let P be th 


a 
Co 





Values 


(a) 


Expected value E of the 
proportion cured 
ou 
w 
Expected value E 





ie} 5 45/7 1 0 35.6 1 
Probability P Fraction (or probability) Q 
(0) (c) 


Fig. 8-12. Illustration of solution to problem with values as in (a), under (b) risk 
(c) uncertainty. 


probability that a patient has di, then 1 — P will be the probability t 
he has ds, and using the same values of our example, we can write 
expected values from Eq. (8-1) as 


E, = 0.9P + 0.3(1 — P) 
E, = 0.1P + (1 — P) 


Figure 8-126 illustrates the graphs of these two equations (with 
point for the given P = 54 indicated). Note that 71 is the treatment 
choice for P to the right of where the lines cross and 7’; is the treatment 
choice for P to the left of the crossing. 

Suppose next that the probabilities ?; were not known, i.e., that th 
were no information available concerning the probabilities; this is the 
of uncertainty, and the process of choosing the optimum decision is cal 
game theory. In the terminology of game theory we can renee the « 
example of treatment choice as follows: There are. two players, 
physician and nature. The physician is trying to determine the 
strategy, i.e., choice of treatment (the decision), from his limited kno 
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cdge of nature. The matrix representation of values, given in, say, Fig. 
8-12a, constitutes the payoffs—what the physician will “win” and nature 
“lose.” But this time we do not know the probabilities that the 7,060 
patients have d; or dz. Our problem is to choose that treatment which 
will ensure that we cure the largest number of people—that is, to maximize 
the minimum possible number of patients that we expect can be cured. 
There are actually three ways we can choose the treatment: (1) treat all 
patients with 7), (2) treat all patients with 72, and (3) treat some patients 
with 7; and others with T.. The first two ways are called ‘“‘oure strate- 
nics,” the third, a “mixed strategy.” 

Consider the values of Fig. 8-12a, and suppose we choose the third 
method of treatment (which really includes the first two anyway). Let 
() be the fraction of patients to be treated with 7,; then 1 — Q is the 
fraction to be treated with T.. Observe that, if all the patients had dy, 
we would expect to cure a proportion of patients 


E(d;) = 0.99 + 0.10. — Q) 


We have graphed this equation in Fig. 8-12c. Similarly, if all the patients 
had d», we would expect to cure a proportion of patients 


E(d2) = 0.3Q + 1.001 — Q) 


We have also graphed this equation in Fig. 8-12c. If some of the patients 
had d; and the rest ds, then the expected value would fall inside the 
hatched area for any particular value of Q. Evidently the lower (thick- 
ened) line in Fig. 8-12¢ represents the minimum proportion of patients 
that we can certainly expect to cure not knowing how many have d, 
or ds, For Q = 0.6, this minimum number is a maximum, and we 
would expect to cure 58 per cent of the patients. Hence 60 per cent of 
the patients should be treated with 7',, and the remaining 40 per cent 
with 7. 

To arrange for such a treatment is easy: Separate the patients at ran- 
(lom into two groups, one containing 60 per cent of the patients, the other 
containing 40 per cent; the former group will be given 7, the latter 7». 
Ilowever, there is another way of arranging for such a treatment, by 
‘hoosing a random number as each patient comes up for treatment. In 
fuct, for the physician with a single patient, the method of choosing a 
rindom number must be used, for the patient cannot be divided up 60-40 
lor treatment! In this manner the concept of mixed strategy can be 
upplied to a single case. 

Such a method for choosing the treatment may be very hard to appreci- 
mle at first contact, but this is just the method used every day when 
probabilities are applied to single situations. Of course, in actual practice 


some further information bearing on the choice of treatment would be 














































































330 [Cuap. 8 


sought—i.e., the formulation of the problem of which treatment to give 
the patient is far more complicated than that posed by the single prob- 
lem discussed above. In conclusion we quote J. D. Williamston the role 
of game theory: 


AUTOMATIC-PROGRAMMING LANGUAGES 


While there are specific applications today, despite the current limitations of 
the theory, perhaps its greatest contribution so far has been an intangible one? 
the general orientation given to people who are faced with overcomplex problems, 
Even though these problems are not strictly solvable—it helps to have a frame= 
work in which to work on them. The concepts of a strategy, the representation 
of the payoffs, the concepts of pure and mixed strategies, and so on, give valua’ 
orientation to persons who must think about complicated situations. 


Dynamic Programming. Consider next the problem of making 
sequence of decisions so as to maximize some value after, say, 2 su 
decisions. A commonly used example for introducing the conce 
involved in this type of problem is that of the toymaker’s dilem 
Suppose a toymaker can find himself in one of four states: manufactur 
a toy that (1) is a best seller, (2) is a good seller, (3) is a poor seller, a 
(4) does not sell at all. Let us denote these states by Hy, Ho, H;, and 
Now suppose that if the toymaker is in any state, say H;, then there 
transition probability p,; that in the next month he will be in state Hj, 
pi that he will remain in state H;. Associated with each such transit 
there is a value, or profit, that can be expected, say v;; for the value of 
transition from state H; to state H;, or vx for remaining in state 
However, notwithstanding the fickleness of the buying public, the 
maker does have some control over the situation: suppose, for insta 
that for each state he can make a decision between two alternatl 
courses of action that can change the transition probabilities and as 
ated values. For example, if he has a best-seller toy, he could adve 
to keep the toy in the public favor, or he could just rest on his lau 
the profit would be less in the former case than the latter owing to 
advertising costs, but the probability of staying in the best-selling #' 
would be greater in the former than the latter case. If he has a poor 
he could invest in research to develop a better toy or he could just h 
that the toy fashions will change next month: in the former case 
probability for a transition to a better state would be higher, but 
might have to use up reserves for funding the research; in the latter 
he would not be increasing the probability for a transition to a bet 
state, but he would not have to dig into his reserves. 

Let T: and T; be the two courses of action that can be taken in 
state; let p%, and vo}; be the transition probability and the associated ¥ 


J. D. Williams, “The Compleat Strategyst,” p. 217, McGraw-Hill Book Com 
Inc., New York, 1954. 
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May June July 
A @ 
H, 
Ms ® 
Hh @ 


l’iq. 8-13. Hlustration of transitions among states: the light lines represent decision 7), 
the heavy lines decision 72; transitions are shown fo state Hin June from all possible 
states in May and from state H» in June to all possible states in July. 


Tape 8-6. ARRAY OF TRANSITION PROBABILITIES AND VALUES, WITH 
ILLUSTRATIVE MAGNITUDES 









































(a) (b) 
State Decision ae Illustrative values 
P . : 
Robebibnee ene siied used in the example in the text 
1 1 1 1 
7, Pir Pia Pis Pras 0.5 0.3 0.1 0.1 
H, vi, vt, vty vty 500 300 100 —200 
best seller 
Ts Pi: Piz Pis Pisa 0.2 0.4 O38 0.1 
vi, Vln Mis le 700 500 300 —100 
1 1 1 1 2 0 1 
Tr, Po: Poe Pos Poa 0.4 0.3 0. . 
Hy Ud 22 Uag Ub 300 §=200 50. — 200 
wood seller 
Ts Pr: Piz Pas Pas 0.2.) OB. Ot 04 
v2, vig v3, Rg 600 400 100 —200 
31 Ps2 Ps PS 0.3 O38 0.2 
Ty P31 P32 P33 Pas 0.2 i 5 7 
H U1 Vie M33 M54 300 100 0 —300 
8 
or seller 
ie i ph Ph Ph Pe | 0.1 0.1 0.5 0.3 
; vt, vk, UR, UR 600 400 100 —200 
wi 1 1 1 1 ¢ 
T Par Pao Pas Pas 0.1 0.2 0.4 0.3 
; vis ule. vby. ott 300 0 —100 —490 
Wy LAs M4 one ie = 
no'saise ph, ple Ph Ph 01 O1 O02 0.6 
: vi, of, vl vy 500 300 0 —200 
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of going from state 7 to state j for decision k on the alternative courses 
of action. Then, for instance, (pj2,0i2), (P22,V32), (P32,%32), and (p42,049) 
represent the possible transitions to state H» under decision 7’; from the 
previous month, and (31,021), (P2,222), (P2333), and (p34,024) represent 
the possible transitions from state H» under decision 7’; for the following 
month (see Fig. 8-13). The situation is summarized in Table 8-6, 
column a. 

We are now ready to state the toymaker’s problem: How can he 
plan a sequence of decisions to maximize his expected profit (value) in 
n months? The solution to this problem will take the form of an array 
giving the best decision for each possible state in which the toymaker 
may find himself and for every month. For instance, Fig. 8-14 shows @ 
6-month solution to our problem for the transition probabilities an 
values given in Table 8-6, column b. Thus, in order to maximize 
chances of obtaining the largest total profit by the end of Septembe 
the toymaker need merely observe his state and the month and re 
from Fig. 8-14 the best decision to make. 















April May June July August Septem 
H, T, T, T, Ti T. T. 
Hp T, T Ts T. T. T. 
H; 7, Ty Ti T, T 1 Ts 


Fra. 8-14. Solution to toymaker’s problem for transition probabilities and valu 
shown in Table 8-6, column b. 


So far we have stated the problem and indicated the form of t 
solution. We now show how the solution is obtained. First, note th 
if the toymaker were in state H; and were interested only in maximiai 
his immediate expected profit, i.e., for one particular month, then t 
decision would be made by calculating 


= Lid? ot 11 11 iit 

Ej} = > viet = Didi + Pidic + Didis + Piia 
J 

2,2 _ ,2,2 2,2 2,2 2,2 

and E? = > vie = Pili + Diig + Disis + Disia 
j 


and choosing decision 7, if E;! were the larger and 7, if H;? were t 
larger. However, in the long run this might not optimize the total pro 
For example, if the toymaker were in the poor state, and it were 
early in the game, then in the long run it might pay to invest in resem 
to develop a better toy and lose money for a month or two with hi 
prospects of having a best seller later on, rather than to do nothing 
no increasing prospects of having a better toy, 
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We solve this problem by means of Richard Bellman’s principle of 
optimality, which follows: Let V;(n) be the total expected value obtained 
in n months, starting from H,, if an optimum sequence of decisions is chosen 
at each month. Then the principle of optimality states that 


Vi(n — 1) = max Y phfos + Vi(n)] 
pe 


max [ BH + ¥ vivicn) | 
3: 


Thus if an optimum sequence of decisions has already been chosen for 
the final n months, then the principle describes how to choose the opti- 
mum decision for the previous month. The technique for performing the 
calculations is as follows: First, choose the boundary, or “end,” con- 
ditions V;(N), if N months in all are to be planned for. Next, for each 
possible state z and alternative decision k, compute 


Y piles + Vi(N)] = BF + Y piVi(N) 
j j 


and for each 7 note for which k a maximum is obtained; call this 
V(N — 1). Then compute 


Y piles + VW — Ly] = Bt + Y piv — 1) 
j j 


and again for each 7 note for which k a maximum is obtained, calling this 
V.(N — 2); and so forth. Of course, the k indicates each time which 
decision is to be made. In this way the sequence of decisions of Fig. 8-14 
was computed, for boundary conditions 


Vi(6) = V.(6) = V3(6) = V4(6) = 0 


The iterative technique utilized to obtain the solution of the dynamic- 
programming problem just described is typical of a large class of prob- 
lems involving over-all rather than local optimization, control analysis, 
and planning in the large range rather than in the neighborhood. We 
quote Richard Bellman :T 


The last years have seen an extraordinary acceleration of interest in the analy- 
xis and control of systems with a particular focussing of attention on self-regulat- 
ing systems. We must face the fact that we have landed with both feet in the 
era of automation. . . . We must then develop new techniques in order to obtain 
solutions to mathematical problems posed by control processes. Fortunately, to 
help us in our efforts, we possess a new tool, the digital computer, whose existence 
changes the very concept of the term “solution.” 


{ R. Bellman, “Adaptive Control Processes,” pp, 3, 7, Princeton University Press, 
Princeton, NvJ,, 1961, 
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8-5. Deductive Inference : Programming for Medical Diagnosist 


Reasoning in Medical Diagnosis. The purpose of this section is to 
illustrate the analysis of a complicated reasoning process—namely, the 
reasoning inherent in a medical diagnosis—in terms of the methods of 
deductive inference. The reasoning foundations of medical diagnosis 
have received recent emphasis as the result of the increasing interest in 
the use of computers to aid the diagnostic processes. Before computers 
can be used effectively for such purposes, however, we need to know more 
about how the physician makes a medical diagnosis. 

If a physician is asked, “How do you make a medical diagnosis?” 
his explanation of the process might be as follows: ‘First, I obtain the 
case facts from the patient’s history, physical examination, and labora 
tory tests. Second, I evaluate the relative importance of the differen 
signs and symptoms. Some of the data may be of first-order importan ’ 
and other data of less importance. Third, to make a differential dia 
nosis, I list all the diseases which the specific case can reasonably resemble 
Then I exclude one disease after another from the list until it becom 
apparent that the case can be fitted into a definite disease category, 
that it may be one of several possible diseases, or else that its exact natu 
cannot be determined.” This, obviously, is a greatly simplified expl 
nation of the process of diagnosis, for the physician might also commen 
that after seeing a patient he often has a “feeling about the case.” ft 
“feeling,” although hard to explain, may be a summation of his imp 
sions concerning the way the data seems to fit together, the patient 
reliability, general appearance, facial expression, and so forth; and t 
physician might add that such thoughts do influence the considered di 
nosis. No one can doubt that complex reasoning processes are involv 
in making a medical diagnosis. The diagnosis is important because 
helps the physician to choose an optimum therapy, a decision which 
itself demands another complex reasoning process. 

Our goal is the interpretation of medical diagnosis in terms of a mat 
matical formulation that accurately reflects the characteristics of medi 
diagnostic processes. Such a formulation of medical diagnosis employa 
sequential decision technique, using logical analysis to determine possi 
alternative diagnoses, probabilistic analysis to determine the probabilit 
of these alternatives, and then value theory and dynamic programmi 


+ This section is based on the following articles by R. 8. Ledley and L. B. Lua 
Reasoning Foundations of Medical Diagnosis, Science, vol. 130, no. Batt pp. 
July 3, 1959; The Use of Electronic Computers in Medical Data Processing, 
Trans. on Med. Electronics, vol. ME-7, pp. 31-47, January, 1960; and Medical 
nosis and Modern Decision Making, Proc. Appl. Math, Symposium, Mareh, t 
American Mathematical Society, Providence, R.I1., 1962. 
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to assist in the choice of treatment plan. In practice, the physician first 
utilizes an iterative procedure of logical analysis and diagnostic testing, 
employing in each cycle tests of increasingly specific nature for further 
narrowing the alternative possible diagnoses. When alternatives still 
remain, Bayes’ formula presents important quantitative insight into the 
relative roles played in the diagnostic decision by both the disease- 
symptom relationships of medical knowledge and the circumstantial fac- 
tors involved in the location, the season, epidemics, etc. Almost surely 
the primary purpose for making a diagnosis is to guide the processes of 
formulating the treatment plan. In many cases the treatment plan must 
he formulated in the absence of a precisely defined diagnosis, and a 
diagnostic-analysis treatment-formulation iterative cycle, in the dynamic- 
programming sense, can be utilized. If such procedures are to be applied 
in practical situations, the use of high-speed electronic computers is 
ulmost certainly indicated; but even in using such computers, care must 
he taken that the computational techniques employed can handle the 
data manipulation within a feasible time period. 

Listing the Diagnostic Possibilities. There are three ingredients to the 
logical concepts inherent in medical diagnosis; these are (1) medical 
knowledge, (2) the signs and symptoms presented by the patient, (3) 
the final medical diagnosis itself. Medical knowledge presents certain 
information about relationships that exist between the symptoms and 
the diseases. The symptoms} present further information associated 
with this particular patient. With these two sources of available infor- 
mation, and by means of the propositional calculus of symbolic logic, 
(he diagnosis is made. Consider n symptoms, denoted by 


S(1), SQ), . . . , S(n) 
ind m diseases, denoted by 
D(1), D2), . . . , Dim) 


The notation to be used here for negation (not), logical product (and), 
logical sum (inclusive or), and implies (zf, then) is that used in Chap. 10. 
A listing can be made of all the 2*+™ combinations of symptoms and 
diseases a patient can and cannot have. For example, the columns of 
lig. 8-15 represent such a listing for three symptoms and three diseases, 
with a unit signifying that the patient has the symptom or disease of the 
vorresponding row, a zero that he does not. The column with the arrow 
represents a patient having symptoms S(1), having S(2), not having 


(3), not having D(1), having D(2), and not having D(3). Each such 


| For the purposes of this section, the terms symptoms, signs, and tests are considered 
synonymous and include medical history, laboratory-test results, ete. 


The physician 
inust determine whether a symptom exists or not, 








7 0123 45 67 0123 4567 0123 4567 


7 0123 456 


Symptom complexes s* 


y 


0123 4567 0123 45 67 0 123 4567 0123 456 
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? 
a combination of symptoms and diseases, called a symptom-disease com- 
4 Pp 
| plex (henceforth, sde), will be denoted by C;‘; each combination of symp- 
g toms, a symptom complex (sc), will be denoted by s‘; and each combina- 
3 tion of diseases, a disease complex (dc), by d;. This notation is illustrated 
§ in Fig. 8-15, where 
§ sid; = C; 
o 8 ' ; , 
a The array of units and zeros of Fig. 8-15 is called a logical basis (see Sec. 
3 10-5) for three diseases and three symptoms and is simply a listing of all 
3 conceivable sde’s. 
op We use the phrase “all conceivable sde’s’”’; but not all these are posszble, 
re) % or actually occur. It is medical knowledge that informs us which sdc’s 
4 do or do not occur. For our example, suppose that medical knowledge 
Ey consists of the following four observations: 
° 
WRX 9 Statement Symbolism 
$ 1, A patient having D(1) and also D(1) - [D(2) + D(3)] — SC) - 8(8) 
5 Qs el 3 either D(2) or D(3) must have 
S EB both symptoms S(1) and S(3) 
n J — — 
2 = 2. Ifa patient does not have D(2), D(2) > S(2) 
Z B then he does not have S(2) 
S g 2 3, Ifa patient does not have D(1) _D() - D(2) - D(3) > S@3) 
eee aa Ly but does have both D(2) and 
WEN RAs oo Sf wh 
ADCDRuawW«@ g 4 D(3), then he has symptom 
SES Gs : 4. Apatient has atleast onesymp- —_[S(1) + S(2) + S(3)]- [D(1) + D@) + DBM 
aan ene tom and at least one disease t 
San 8xns 
acs ana a is + In medical practice the presence of a complaint (symptom) must be regarded as 
pe a oe presumptive evidence of disease, but the absence of a complaint cannot be equated 
aOoo one with absence of disease. 


‘The logical product of such statements is the Boolean function H that 
represents the relationships between disease and symptoms that comprise 
medical knowledge; i.e., 


E = E{S(1), . . . ,S(m),D(), . . . ,D(m)] 
lor our example, 


i = {D() - [DQ) + DB)|> SQ) - SB)} 
: (DQ) > SQ)} + {DU - D2) - DB) > S)} 
 {[S(1) + S(2) + 8(3)] (DC) + DQ) + DY} 


Fic. 8-15. Logical basis for the illustrative example, 


knowledge. 


‘The sde columns crossed off in Fig. 8-15 represent sde’s that fail to satisfy 
one or more of these relations. For instance, consider the rightmost 





————————————  —— 
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crossed-off column, namely, C;°, which is 

S(1) 0 

S(2) 1 

S(3) 1 

D() 1 

D(2) 1 

DB) 1 


This sde represents a patient having D(1) and certainly D(2) or D(8), 
but not having S(1), contrary to statement 1, and hence it has been 
crossed off. Another point of view is that the only columns of the basis 
that are to remain are those which each satisfy all the statements of 
medical knowledge. The remaining columns form what is called @ 
reduced (constrained) basis (see Sec. 10-9). 

Consider next a patient to be diagnosed. The patient presents 
profile of certain symptoms that he does and does not have; however, 
general there will remain other symptoms about which no information h 
been obtained from the patient. Such a possibly incomplete sympto 
complex is called the patient’s symptom profile G. Similarly, a possib 
incomplete disease complex relating to a patient we call the patient’ 
disease profile f. The combination of these profiles is the patient 
symptom-disease profile, f-G. Thus the patient’s profile G is a Boole 


function of the symptoms: 
G =G{[S(1), .. - ,S@)] 
Now we are ready to state the logical problem: given the patient 


symptom profile G and medical knowledge FE, we desire to find t 
patient’s disease profile f as a Boolean function of the diseases, 


f =f[DQ), . - - Pom) 
For our example, suppose that the patient does not have symptom S( 
but does have symptom S(2); that is, ; 

G = SC) - S(2) 
The only remaining sde’s that satisfy this condition are C22, C2, 
C5, corresponding to d» and ds. Thus the diagnosis becomes 
f = DQ): D2): D3) + Dd) - D(2) - D3) 
= D(1): D2) 


Hence the patient does not have D(1) and does have D(2), but it is 
known whether or not he has D(3). As another example, let 


@ = S(1) : S(2) 
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Then it is easily found that we ha i i 
ve a diagnosis of four ibiliti 
namely, dz, ds, de, and d;; thus pee 


D(I) - D(2) - DB) + DQ) - D2) - DB) 
+ DU): D2) - D(3) + DQ): D(2): DB) 


P= 


D(2) 


which means that the patient has disease D(2) 
and may or ma t have 
D(1) or D3). As a final example, let seat 


G = S(1) - S22) - 8B) 


ie let S(3) be taken as a further test, and let the result be negative 
ere 


f = Dd) - D2) - DB) 


which is a unique diagnosis which resulted from the additional test. 
| he methods of Chap. 10 can be utilized for determining the diagnostic 
possibilities, where the problem is one of solving the equation 


Gof 


for the function f, with respect to a basis constrained by the function E 

Determining Further Tests. The ease with which medical avngion 
or test results can be obtained varies greatly with the complexity of the 
examination, the cost, the inconvenience to the patient, and so forth 
Mhus, in any particular medical specialty, the Sicapniniytionis could be 
tlussified using difficulty of execution as the criterion. For example, it is 
often a fairly straightforward matter to obtain some aspects of a patient's 
medical history; then some parts of a physical examination are soe 
convenient to perform than others; or some blood tests and X-ray 
examinations are less difficult and costly than others; and finally, certain 
(ests, such as biopsies and exploratory operations, are used only when no 
other technique can provide the information needed. Clearly it is not 
leusible to make all possible tests on a patient before determining the 
diagnosis. The more realistic method uses iterated stages: making a 
kroup of tests, determining whether or not a precise diagnosis can be 
made from the results so far obtained, determining what other tests 
rhould be made, making these tests, etc. (see Fig. 8-16). 

. Chis problem can be formulated in terms of our logical considerations 
ol the previous section, The formulation will be described by means af 
an illustration, Figure 8-17 shows a set of symptom-disease complexes 
associated with 14 symptoms and 8 diseases, For this illustration, the 
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symptoms have been classified into history, physical examination, 
laboratory blood tests, and bone marrow tests, in order of increasin 

difficulty of performance. Notice that S(1) actually contributes nothing 
to the diagnosis: having a unit in every column, it does not distinguish 
between any columns. Suppose that the patient has both S(2) and S(3). 
Then the diagnosis so far is that he has a disease complex associated with 
one of the leftmost 11 columns. Because of the patient’s symptoms, we 
can eliminate from consideration the rest of the columns shown shaded i 
Fig. 8-17: Of the physical-examination symptoms (4), S(5), and S(6), 
only two are necessary, for all three cannot better differentiate among the 





i.e., determine 
whether or not the patient has S(z), . . - , 
S(r) 


Perform further tests; 






Make adjusted logical diagnosis; i.e., 






further eliminate D(j), ..., D(@ | Unique 
from consideration diagnosis 
obtained 


> 





Determine tests to be made next 
No further 


tests can 
be made 











Fic, 8-16. The diagnostic-testing iteration. 


remaining columns than can any two. Suppose that S(4) and S( 
are taken and that the patient has S(4) and not S(5); then only 7 colu 
remain as possible diagnoses, columns 3 through 9, as shown in Fig. 8-1 
The choice among the blood tests S(7) through S(12) is not so easy 
make. It can be shown that only three of these six tests need be mit 
namely, S(9), S(10), and S(12). Suppose that the patient prese 
positive results for tests S(9) and S(12) and a negative result for t 
S(10). Then only a single column remains, column 5, as shown in I 
8-17. Thus the bone marrow tests need not be made; the patient has 
combination of diseases D(4) and D(5) and none of the other disea 
Suppose, on the other hand, that at the physical examination the pati 
had both S(4) and S(5). Then only columns |! and 2 would remain, 
the blood tests would not distinguish between them; bone marrow 
S(13) would have to be made. 
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Calculating the Alternative Diagnostic Probabilities. Frequently it is 
unfeasible or undesirable to make further tests to distinguish among 
alternative possible diagnoses. The problem then resolves into the 
question: In the light of the patient’s present symptom profile G what is 
the probability that the patient has a particular disease complex f— 
that is, what is the conditicnal probability P({|G@), where the term to the 


Column numbers: 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 
P S(1) 114141 1111 11 1 
aor S(2) 1111 1111 11 0 
it | — S(3) 114141 11141 11 0 
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Wid, 8-17, Logical basis for symptom-disease complexes associated with 14 symptoms 
und 8 diseases. After each set of diagnostic tests, the possible diagnoses are studied 
to determine what tests to do next. 


right of the stroke is the condition (symptom profile) and the term to the 
loft is the event (disease complex)? That probabilities can enter into a 
diagnosis depends on the fact that probability can enter into medical 
knowledge. For example, a medical textbook may read that, if the 
patient has disease j, then there is only a certain chance that he will have 
xymptom?. At present it may generally be said that specific probabilities 
are rarely known; medical diagnostic textbooks rarely give numerical 
values. It is, however, a relatively simple matter to collect such sta- 
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i ideri i tially academic 

tistics. Since we are considering topics from an essen 
point of view, we shall assume that the probabilities are known and we 
shall discuss methods of utilizing such probabilities in medical aie 

Consider, for example, a patient having ‘symptom profile S(1) - S( dy 
in the illustration of Fig. 8-15. The logical diagnosis results in two 
alternative diagnoses, d2 and de. If we let 

P(st-d)) = PCS) 
then the probabilities for diagnoses d; and ds become 
anv P(C.?) + P(C2*) 
Plaa|S@) SQ) = peay + PC) + PC 


P(Cs°) 
P{ds|SC) - S@)] = P(C22) + P(C2°) + P(C6°) 








and 


We have illustrated the general probabilistic problem in medical diag- 
nosis, which is to determine the conditional probabilities 


P(d|I@ foreachj Ef 


those values of j that index the alternative 
disease complexes remaining after the logical-diagnosis test pare re 
been made and where G is the patient’s symptom profile. This a " 
bility clearly depends on the characteristics of the population of patien ; 
involved and on the number of diseases under consideration. For ee 

application, it is important to observe that probability evaluations mie 
at each stage of the logical-diagnosis test iteration will not in any ‘ 
necessarily indicate a trend toward the final diagnosis. For ona 
consider a patient with a disease for which there is a specific test: u 7 
that test is made, the probability of the patient’s having the sprig: i 

actually grow smaller and smaller with each iteration, but eke i e — 
made, this probability will rise to unity! On the other an ; : p 

becomes infeasible to make additional cet the probabilities must play 
i le in the choice of treatment. _ i 
een Bayes’ formula can give important insight i ¥ 
types of ingredients which enter into a probabilistic rien 
disease complex d; and symptom complex s', the formula relates fi 


with P(s‘ld;), as follows: 


where by j G f we mean 


P(d;)P(s'|d;) 
P(dx)P(s'\de) 


allk 


P(d;\s') = 


where > over all & indicates summation over all the possible hss 
plexes under consideration, Let us focus our attention on the two te 
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in the numerator, namely, 

P(d;)P(s‘|dj) 
since the denominator merely acts as a normalization factor. The term 


P(s‘\dj) is the probability of having the symptom complex s‘ when it is 
known that the patient has the particular disease complex d;. This is 
precisely what is described in medical textbooks when a particular disease 
is under consideration. Of course, textbooks do not at the present time 
assign numerical values to P(s‘|d;), but they do state that associated with 
2 certain disease particular symptoms are common, frequent, rare, and 
so forth. The reason that P(s*‘|d;) is discussed rather than P(d;|s‘) is that 
the etiology of the symptoms is related to, or stems from, the disease. 
Thus P(s‘|d;), that is, the symptoms resulting from a disease, 7s generally 
a constant for the particular disease complex under consideration, being 
relatively independent of other circumstantial factors. 

On the other hand, consider the term P(d;). This is the probability 
that, of the particular population of patients involved, any particular 
patient has this disease complex d;. By “particular population of 
patients,” we mean that population from which the patient can be con- 
sidered as chosen at random. For example, the population may be that 
of the entire United States, or that of a particular clinic in a particular 
hospital, ete.; in each case, P(d;) will in general be different. This term 
relates the influence of geographical, seasonal, epidemiological, social, and 
other such factors to the diagnosis. Thus, in the tropics one is more likely 
(o find tropical disease; during an Asian flu epidemic, the symptoms of a 
cold are more likely to be related to Asian flu; some diseases, like polio- 
inyelitis, have a seasonal variation; and so forth. A little reflection indi- 
cites clearly that both these factors—the circumstantial, or local, and the 
constant relating to the disease-symptom syndrome—influence the diag- 
Y Pdi) P (tlds), being a 
i 
normalizing factor, is rarely needed: this comes about because in evalu- 
ating the probabilities of alternative diagnosis it is the relative ratios of 
the probabilities that indicate the most likely diagnosis. 

The significance of Bayes’ formula now becomes clear. Since P(s‘|d;) 
in a constant, dependent only on the pathological-etiological aspects of 
(he particular disease processes associated with d;, tables of these condi- 
(ional probabilities can be made. Then, for any particular population— 
such as a particular clinic, or the patients of a particular physician— 
the total probability P(d;) can be ascertained, as affected by the local 
environmental conditions, and the desired diagnostic probability P (dj\s‘) 
evaluated by the formula, This is, for example, what a physician quali- 
tutively does when he tells a patient over the telephone; ‘Your symp- 


nosis. The denominator of Bayes’ formula, 
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toms of headache, mild fever, cough, general malaise, etc., indicate that 
you probably have Asian flu—many of my patients have it also—it’s 
around our community, you know.” 

As noted above, we are actually interested in P(d;|@), where G@ is the 
patient’s symptom profile —i.e., a Boolean function of the S(i). In this 
case it is easy to see that Bayes’ formula can be expressed as 

iEG 

Y Pa )PCs'ld) 
iEG 
) 5 P(d,)P(s'|dy) 





P(d,\G@) 





where 7 € G over = means that the summation is taken over all symptom 
complexes s' included in G, that is, over all symptom complexes who 
column positions correspond to the position of the units in the design 
tion number of G. The analogous formula for P(f|G), where f is 
Boolean function of the diseases, becomes 

iCG 

> ¥ P@yPo'ld) 
PUG) = ier 
Y Pldv)PCs'ldi) 


all k 





where the j Gf under = indicates the summation over all disease co 
plexes whose column positions correspond to the unit positions in t 
designation number of f. 

If the symptoms can be considered independent (which can only be 
approximation), then we can make use of the fact that in such a case 


PIS@) +--+: S(k)|dj| = P[S(a)\dj] See ee DK P{S(k)|dj] 
and P[S@|d;] = 1 — PIS@Idi) 


where it may be easier to determine P[S(k)|d;] than P(s'\d;). VP 
instance, in such a case we might have ; 


P (ds) P(S(1)|del PIS (2) |ds] 


Y. P(d:)P{S()|aul PLS) [del 
all k 
P(ds){1 — PIS()|dol} PIS@) [dal 


: P(ds) {1 — PS) \de]} P[S(2)|di) 
allk 


P\ds)SQ) - S(2)] = 








and Pld2{SC) - S(2)] = 


The Dynamic Diagnostic-treatment Problem. Suppose that alternat 
diagnoses remained for a patient after the logical analysis, that the pro 
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bilities of these diagnoses have been determined, and that treatment has 
been instituted. Clearly the effectiveness of the treatment on improving 
the patient’s state of health will produce additional information about 
the alternative diagnoses and after a time may require a reevaluation of 
the choice of treatment. Such a treatment-diagnosis cycle, where each 
treatment itself acts like an additional test, is commonly utilized by 
physicians. However, it is the effect of the treatment on the patient’s 
state of health that produces the required information. 





V3'(n) 





1 2 3 4 


ig. 8-18. Illustration of treatment-reevaluation pla:.e for patient with disease com- 
plex d,, starting in health state H;. We assume (to simplify the drawing) that Hi 
in the cured state and the patient will stay well and that H, is the critical state from 
which the patient cannot get well. Heavy lines represent transitions for treatment 
1'\, associated with pi} and v';; light lines represent transitions for treatment T., 
ssociated with p% and vj. The graph indicates that V3"(n) levels off in approxi- 
innately four stages. 


Suppose, for example, that four states of health of the patient can be 
distinguished: Hy, cured; He, good; Hs, poor; H4, critical. Suppose also 
that there are two possible treatments 71 and 7, that can be given for 
each state of health. For each of the possible diseases d, that the patient 
inuy have, let pit and vt! be the transition probability and value, respec- 
lively, of going from state of health H; to state H; under treatment 7%. 
If we are concerned with only two disease complexes, say d; and de, then 
we would have a table similar to Table 8-6, column a, for each disease. 
l’or the moment, let us restrict ourselves to one of the diagnostic alterna- 
lives, say d,, and consider the problem of choosing the “best” of the 
possible courses of treatments to “cure” this disease complex. Of course, 
after a treatment is instituted, the patient’s health state may undergo a 
transition to another state, and the choice of treatment must be further 
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reevaluated, and so forth. The method for determining the choice of 
treatment that will maximize the expected value is dynamic program- 
ming, as discussed in the previous section. The only problem that 
remains is how to choose N, the number of iterations within which the 


TaBLE 8-7. VALUES FOR EXAMPLE IN TEXT 
NN 
Transition probabilities 


Health | Treat- and values 








E . Boundary 
state ment xpectation values 
jaillja2)f7=3 |) 7-4 
A, Vir (N) = i 





.20 | 0.70 0.10 0 
.30 | 0.20 | —0.20| —10 


HH - V2r(N) 


rp, Pi E,! = 0.22 


25 
oo 





ll 
| 
— 








1» pa | 0.40) 0.30] 0.30! 0 ' 
T: i | 9.50 | 0.30 |—0.20)-10 | 2 — 9-18 
ri 
pi | 0.09 | 0.60 | 0.30) 0.01)», _ 
ie v5 | 0.40 | 0.30 | 0 —10 By Oe 
Hy; — f V3"(N) = -8 
2/0.19|0.30| 0.50; 0.01!» _ 
T: x |0.80]0.50| 0 |-10 |*"— ~%-% 


























cure is to be effected. If the patient’s health is initially in state 
the method is to graph V,(n) as the backward (') calculation is acco 
plished (see Fig. 8-18); Vi(m) will almost always rise to an asymptoti 
and when it gets close to the asymptote, the process may be discontin 
and the number of backward steps will be a good value for N. N 


n 1 2 3 4 5 6 7 8 9 10 It 12 = N 





He | TM T. | UT: T T: Ti | To T: T: Te: Tr | Ts 
A, Ti Ty \ Ts Ts T. T. \ Ts T. T. T> T3 ' 


Fic. 8-19. Optimum-treatment choice for the values of Table 8-7. The dashed li 
denote places of change of treatment. 


from Fig. 8-18 that we have assumed that when the patient is cu 
the probability of getting sick again is zero and that when the patient 
in critical health the probability of getting well again is zero. 

As an example of hypothetical probabilities and values, consider ‘T’ 
8-7. Figure 8-19 gives the complete sequence of treatment deci 
calculated from these values. 
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We are now ready to consider the case in which more than one diag- 
nostic alternative remains. The diagnostic possibilities are mutually 
exclusive, and the patient must have one of them, say d,, with probability 
P, calculated by the methods discussed above. For each possible disease 
complex d,, a calculation as described above is made, and for each d,, 
an N,, or number of cycles to plan for, is evaluated. The initial treat- 
ment is considered as being given at this N, for each disease possibility, 





I'1q. 8-20. Illustration of alignment of the d; and ds treatment-value planes for the 
start of the iteration. 


as if the treatment-value planes of Fig. 8-18 for each disease d, were aligned 
(see Fig. 8-20). Now we initiate the treatment-diagnosis iteration by 
computing for each k 


men) =P, [ Bet ¥ vtvr@| 
r all 7 
where for our example, to start with, 7 = 3. The initial treatment of 
choice becomes that 7, for which E*(n) isa maximum. After the treat- 
ment has been instituted for a sufficient period of time to evaluate its 
effectiveness, the transition in the patient’s state of health, to H,, is 
observed. ‘This observation can now be utilized in the sense of another 
towt result for the logical analysis, A reevaluation of the logical analysis 
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in the light of this new test result. may reduce the number of alternative 
possible disease complexes that need be considered. Then the probabili- 
ties of the remaining disease complexes are reevaluated, resulting in a 
new collection of probabilities P,. These probabilities are utilized in 
evaluating, for each treatment 7, 


E#(n +.1) = Dey [He + > piVir(n + 1)| 


all j 


where now r takes on a restricted number of different values. The k 
corresponding to max H,*(n + 1) is taken to indicate the treatment of 
choice, and the iterations continue, as indicated in Fig. 8-21. 









Determine best treatment T 


Observe transition in 
state of health H 


Can patient be discharged from hospital? 
No 


Reevaluate logical analysis 
to possibly reduce number of 
alternative disease complexes d 


Reevaluate probabilities P 


Fig. 821. Flow chart of the dynamic diagnostic-treatment cycle. 






Discharge patient 







The iteration ends when a number of cycles n = min(N,) has beet 
accomplished or when the patient is cured, whichever occurs first. If 
the patient still has not been cured, then the process is reinitiated for t 
remaining disease complexes, just as described before. 


8-6. Inductive Inference: Heuristic Programming 


Meaning of Heuristic Programming. Inductive inference is the proces 
of hypothesis making in a trial-and-error fashion, in order to draw Cone» 
clusions or to solve problems. Whereas deductive inference is relate 
to the mathematical methods currently in common use (as described 
the two previous sections), inductive inference is related to the ingenuil: 
and resourcefulness utilized in originally formulating such mat hematic 
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methods. Inductive inference, then, is the problem-solving process as 
carried out by people, which can be described roughly as follows:t 


The work begins routinely, and then suddenly the person experiences a flash 
of understanding. ‘This is followed by the writing down and checking of the 
solution. Apparently the person first used heuristic methods to look for a solu- 
tion. To each suggestion turned up by the heuristic methods he applies some sort 
of test. The flash of understanding comes when a suggestion gets a high score 
on the test. 


Heuristict programming is the attempt to simulate on a computer 
the processes of inductive inference. It is the attempt to program com- 
puters to solve problems by somewhat the same methods that “intel- 
ligent” beings use. For example, when a man plays chess, he utilizes his 
intuition and experience to select only a certain small collection of all 
possible moves to be carefully examined—rather than determining the 
consequences of, say, all the possibilities two moves ahead (a little less 
than 1 million). A heuristic program utilizes such “intelligent,” or 
skilled, guesses of what is best to examine in the attempt to solve a 
problem. Such a heuristic approach appears to be the most promising 
approach for the programming of computers in problems that are very 
loose-structured, extremely complex, and otherwise sufficiently difficult 
that man requires “intelligence” to solve them. For such problems, 
“good” hypotheses for trial are chosen—using as criteria for “goodness” 
certain available information that may suggest which possible solutions 
should be tried first, or that may rule out whole classes of solutions from 
being tried, or that may provide quick tests for distinguishing likely 
solutions from unlikely ones, and so on. Such criteria are called the 
heuristics of the problem. The field of heuristic programming is con- 
cerned with the investigation and understanding of various aspects of 
heuristics, such as how they are discovered, what kinds there are, what 
heuristics are used by humans, and so forth. 

In the present section we shall illustrate the problems involved in 
heuristic programming by means of the example of proving plane- 
yeometry theorems; on the basis of experience gained in this discussion, 
in the next section we shall describe an attempt at a “general problem- 
solver” program. It should be mentioned that plane geometry does yield 


{| Quoted from H, L. Gelernter and N. Rochester, Intelligent Behavior in Problem- 
solving Machines, JBM J., vol. 2, no. 4, pp. 336-345, October, 1958. The present 
section is mainly based on this article. 

{ In standard usage, the term “heuristic”? applies to discussions which stimulate in 
(heir auditors or readers the desire to investigate the subject further. Machines, 
however, must have this “desire” programmed in, of course, and the term has been 
extended to the process of investigation itself, to cover its empirical and probative 
aapecta, 
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to a “decision algorithm’—i.e., a deductive technique for proving all 
theorems in plane geometry. However, the problem here is not to study 
plane geometry but rather to study how to program the computer to find 
proofs for generating (geometric) theorems, in the same manner that an 
intelligent high-school student would—using ingenuity, coupled with trial 
diagrams, etc., asa “heuristic” for suggesting possible “‘good”’ approaches. 

Proving Geometric Theorems. ‘To understand the meaning of a “proof” 
in geometry, we must first examine geometry from a mathematical point 
of view. To a mathematician, the theorems of geometry are formal 
relations among abstract entities, which basically need have no physical 
reality or significance—but which can be related to “points” and “lines” 
in special pictures if this seems interesting for the purpose at hand. To 
the computer, geometry must be considered as such a formal system, but 
the computer will use the interpretation of geometry as lines and points 
in space as a source for heuristics. A formal mathematical system has 
four ingredients: 

1. A set of primitive symbols 

2. Rules for the formation of well-formed formulas 

3. Axioms 

4. Theorems 

The idea of primitive symbols and the concept of well-formed formulas 
are not new to us and have been used previously in the development of 
the ALGOL and COBOL automatic-programming languages (see Chaps. 
6 and 7). For our geometry, the set of primitive symbols comprise the 
characters that represent names of points and lines (considered as unde= 
fined abstract entities) and the symbols that represent relations between 
the points and lines. The rules specify how to assemble the primitive 
symbols into well-formed formulas that are to have “meaning” in geome 
etry, whether they be true or false. For instance, “Three points always 
lie on a straight line” is a well-formed formula (but false), wheren# 
“Sides triangle of parallel every” is not well formed. 

An axiom is a well-formed formula, such as “Two points can lie on on 
and only one straight line,”’ which is regarded as being true by definition, 
From a set of axioms, new well-formed formulas can be derived as bein 
true by means of “rules of inference,” or laws of logical reasoning. Also, 
new well-formed formulas can similarly be derived (as being true) from 
previously established well-formed formulas. A proof is a sequence of 
well-formed formulas, in which each formula (or line of proof) eith 
follows from preceding formulas by a rule of inference or else is itself a 
axiom or previously established theorem. A theorem is the last line in 
proof. 

Thus, in order for the computer to establish the proof of a theorem, 
must produce a sequence of well-formed formulas whose last formula 
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the theorem and whose first formula is an axiom or previously established 
theorem and in which every other formula of the sequence is either deriva- 
ble from preceding formulas or is an axiom or previously established 
theorem. It must be pointed out that the finding of such a proof, or 
sequence of well-formed formulas, is quite different from the checking of a 
given proof. For checking a proof, each successive line of the proof 
would be examined in relation to the previous lines, to determine which 
law of logic was used for its derivation. This should be a relatively 
straightforward problem to program for a computer: each formula, or 
line of proof, would be a list of characters or words in the computer’s 
memory, and the laws of logic would be subroutines. We shall assume 
that such a proof-checking routine has already been written for our 
computer. 

On the other hand, the process of finding a proof requires ingenuity 
and imagination, which are used by our intelligent high-school student. 
We might give our high-school student some helpful hints, such as 
‘If you are trying to prove that two lines are parallel, try using the 
theorem ‘If the opposite interior angles, formed by the intersection of two 
lines with a third, are equal, then the two lines are parallel.’’’? Drawing 
a picture might help suggest a method of approach to finding the proof. 
Such a figure might give insight into the problem. These are heuristic 
aids, and we shall now consider how a computer can be programmed to 
utilize such heuristic aids. 

The Heuristic-programming Technique. In general, human beings do 
not solve problems directly but break them down into easier subproblems 
whose totality is equivalent to the original problem. This breaking down 
of a problem is a regression technique, which, of course, may not lead to a 
solution at all! In fact, this is one of the characteristics of the heuristic 
method—there is no guarantee that a solution will actually be obtained. 
‘To illustrate the point, consider the following argument :t 


I want to take my son to nursery school. What's the difference between what 
| have and what I want? One of distance. What changes distance? My 
automobile. My automobile won’t work. What’s needed to make it work? 
\ new battery. What has new batteries? An auto repair shop. I want the 
repair shop to put in a new battery; but the shop doesn’t know I need one. 
What is the difficulty? One of communication. What allows communication? 
A telephone. . . . 


rom the original goal, such an analysis forms subgoals, functions that 
need to be performed, and operations to perform them. Tor instance, 
lot Go be the goal established by the problem of formulating a geometric 


{ From A, Newell, J, ©. Shaw, and H. A, Simon, Report on a General Problem- 
wolving Program, Paper P-1584, pp, 8), Rand Corp, Santa Monica, Calif., 1959. 
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proof—that is, Go is the formal statement or theorem to be proved. If 
G, is another formal statement from which Go can be inferred, then Gi 
is called a subgoal. In general, if G;_1 can be inferred from G;, then G; 
is a subgoal of order 7 and all G; for j > 7 are called higher subgoals than 
G,. The problem-solving graph of Fig. 8-22 summarizes the regression of 


Ge 
a 
Gy G} 
0 1 2 3 4 
G2 G2 G2 Go Go 
oe we be nf \e | / Ne 
1 2: 3 4 Gé Gs 7 Gé i 


Pj 
G? G3 G3 G3 t f \ AK { 
Fig. 822. Problem-solving graph illustrating a structure of subgoals G,« and trans: 
formation operators P18. 


subgoals; the G;* represent subgoals of order 7, with a numbering the su 
goals of a given order, and each P;** is an operator that can transform i 
given G,* into G®_,. The geometric problem is solved when any G; oa 
be obtained from the hypotheses and axioms. 

The heuristics are utilized in developing the problem-solving gra 
during the heuristic processing. There are in general two types 
heuristics, syntactic heuristics and semantic heuristics—named for the ty 
of entity examined when the heuristic is being applied. We shall fi 
illustrate the concept of a syntactic heuristic in geometry, as introduce 

by the following example. Su 


7 pose we are to prove the follow 
ing theorem: 
The diagonals of a parallelogram 
bisect each other. 
From Fig. 8-23 we see that 16 
prove the theorem we miu) 
A D establish the subgoals 


Fia. 8-23. Heuristic-aid figure for proving 
that the diagonals of a parallelogram 
bisect each other. 


Go: AK = KC 


and Gy': BE = ED 
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The computer should be able to recognize, as people usually do, that the 
proof of the right-hand formula is essentially the same as that for the left- 
hand formula, and that therefore only one of these need be established. 
Similarly, we would not want the computer, after starting out to prove 
that AAEB & ADEC, to set up the subgoal of proving that 


AAED = ABEC 


These requirements are met if the syntactic heuristic of symmetry is 
utilized. 

Let 7 be a permutation on the names of the syntactic variables in a 
theorem; then 7 is called a syntactic symmetry of the theorem if its 
operation on the hypotheses leaves them unchanged except for a possible 
transformation into an equivalent form with respect to the symmetries 
of the symbols. If {H} is a set of hypotheses and 7 is a syntactical sym- 
metry for a theorem, then 

w\|H} = {H} 


We can now state in terms of z the important observation upon which the 
syntactic heuristic of symmetry is based: Jf T zs a well-formed formula 
provable from {H}, then xV ts a well-formed formula provable from {H}. 
‘The proof of this observation is as follows: 


If {H} 57 
then r{H} > aT 


is true 

because renaming the syntactic variables 
in a theorem does not destroy the validity 
of the theorem 

by hypothesis 

is true, as desired 


But 7{H} = {H} 
so that {H}— oI 


The application of our observation as a heuristic in aiding the selection 
of appropriate subgoals to a problem is as follows: G; is not a suitable 
subgoal if it is syntactically symmetric with any other subgoal G; for 7 < 2 
(since the same proof for G; would hold for G; with only a permutation of 
syntactical variables). For example, consider the problem of Fig. 8-23; 
the computer will eliminate Go! from consideration since 


1G o! = G)° 


where w transforms A into B, B into C, C into D, and D into A. Hence 
after proving Go? the computer will merely assert ‘‘similarly Go!.”” Also, 
if at some point AALB = ADEC is a subgoal, the heuristic of symmetry 
will eliminate AAHD & ABEC as a possible subgoal, and so forth. 

Such syntactical heuristics will be essential to the success of the 
goometric-theorem-proving program, to eliminate redundant paths and 
present an efficient problem-solving graph, However, such syntactical 
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heuristics do not tell which path the computer is to try on the graph; 
for this information, semantic heuristics are required. For example, 


one of the first heuristics that should be applied for proving geometri¢ 
theorems with the aid of diagrams is the following: 


If the diagram consists of a “bare” simple polygon, a construction will 


probably be required. 


If a construction is indicated, a heuristic about which construction to 
make might be this: 


If the figure has an axis of symmetry and it rs not drawn, then draw i, 
Applying such semantic heuristics to our problem, we would draw the 
lines AC and BD intersecting at HE. Another useful heuristic will be this? 


If the theorem asks that two line segments or angles be proved equal 
determine by measuring whether these are corresponding parts of apparenll 
congruent triangles. If so, attempt to prove the congruence. 


Applying this heuristic we would develop the subgoal to prove tha 
AAEB = ADEC. Another frequently used heuristic is this: 


4 

To prove two angles are equal, see whether they are interior opposite angl 

formed by a transversal of two parallel lines or whether they are oppost 
angles formed by two intersecting lines. 


Applying this heuristic, we can set as subgoals that “BAKE = } 
and x AEB = XDEC, which would show that AAHB= ADEC. 


Finally, to prove that AE = EC, the subg 


AB = DC would be set up; but this follows from t 


hypothesis of the theorem and hence the proof 
constructed. 
routine 
Fig. 8-24. Block 


Programming Organization. The program for pr' 
ing geometric theorems, constructed by H. 
Gelernter and N. Rochester, has three parts, a sym 

diagram of the pro- 

gram to prove geo- 

metric theorems. 











Syntax 
routine 


routine, a diagram (or semantic) routine, and a heurt 
routine (see Fig. 8-24). The syntax routine forms 
keeps track of the lists of syntactical entities that co 
prise the lines of the proof and also checks the hew 
tically suggested trials. The diagram, or semant 
routine makes constructions and measures them (by means of anal 
geometry); its output is not the values of the measurements but rat 
only qualitative information of the type acquired by a mathemati 
scanning a well-drawn figure. The heuristic routine is the boss, #0 
speak; it decides what to do next, directing the diagram routine to 
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form certain constructions and the s i i 
yntax routine to test | 
and so forth. Beers 


8-7. Inductive Inference: The General Problem-solving Program 


Concept of a General Problem-solving Program. The fact that a heuristic 
program basically operates on syntactical collections, or formulas, to 
obtain new formulas (e.g., operator P;“* transforms G2 into G?.,) reiniida 
one of our translator of Sec. 8-2, which also operated Ga ayataehioat 
entities. Of course, the translator involved no heuristic considerations 
But it did embody those concepts that related to programming-langua : 
translation in general, and operated upon those entities that related is 
the source or target language in particular; and hence it could be applied 
to any source-to-target-language translation. The question immediately 
arises of whether, in a similar manner, the general aspects of a heuristic 
program could be separated from the particular and a general problem- 
solving program constructed that would handle many different heuristic- 
programming problems when provided with the particular information 
required for a particular problem. A. Newell, J. C. Shaw, and H. A 
Simon have indeed suggested such a general problem solver which ie the 
topic of this section. f 
The general problem-solving program is based on the results of an 
analysis of heuristics in which three general heuristic types are distin- 
guished. Each such type of heuristic is associated with a goal-related 
(ransformation type. We now describe these transformation heuristic- 
type combinations (see Fig. 8-25). The first goal-related transformation 
{ ype is concerned with transforming the subgoal form G; under consider- 
ition at any stage of the problem into the desired final goal form Gy 
lor example, in the process of demonstrating the trigonometric identity 


(tan 6 + cot 6) sin @ cos 6 = 1 


1 subgoal might be to transform 


| G;: Gata + cot ) sin 6 cos 6 
into Go: 1 

The heuristic for this first type is to determine, if possible, the difference 
between G; and Gy and then reduce this difference (see Fig. 8-25 heuristic a) 

he difference between a G; and Gy could be, for instance diferentes in 
the variables that appear in each (called AV differences): or differences 
in the number of times a variable occurs in each (called AN differences) 

and so forth, For our illustration both AV and AN differences exist. 


{| See dbid, The techniques and examples of this section are based on that report. 


































































































CC .....#eeeweeeeeeeeeeeeeeeeee 


356 AUTOMATIC-PROGRAMMING LANGUAGES [Cuap. 8 


Reducing the difference between goal forms is a goal-related a 
mation of the second type, and leads to the heuristic of the wee? ype 
(see Fig. 8-25, heuristic ). The difference is reduced by seare Hy ig 
an operator Q that can reduce the kind of difference in question a = ; 
applying the operator to G,;. For example, consider the operator 


Heuristic a: Transform a into form b 













Reduce difference d between 
aand b (obtaining c) by 
means of heuristic 6 


Match a with b to 
determine difference 
d between a and b 









Start with 
a=Gn; 
otherwise 
a= G; 












Heuristic Heuristic fails 


b= { Go m 
(Q); see heuristic ¢ Seale 


Heuristic 6: Reduce difference d between a and 6, obtaining modified form c 


Try for new operator 
Apply Q to a by 
means of heuristic c 















Succeed 





Search for relevant 
operator Q 


Enter from 


c=G. 
heuristic a 





Heuristic fails 


Heuristic c: Apply operator Q to a, forming c 




















Succeed _ | Generate resulting 


Transform a into L(Q) formc from R(Q) 


by means of heuristic a 
Fail 


Enter from 
heuristic 5 






Heuristic fails 


i isties of the general probl 
95. Block diagrams of the three types of heuristics 0 I 
Oe oma Gr is the hypothesis, or starting form; G, is the desired final fo 
and G; is a subgoal form. 


turns cot x into cos 2/sin «,; when applied to our illustrative G;, the re 


—+ = 5) sin 6 cos 0 
cos 6/sin@ ~~ sin 8 


which has only two different variable forms (sin 6 and cos 0) naa 
the three (sin 8, cos 0, and cot 9) of the original Gi. Hence the diffe 
AV between G; and G» has been reduced by the application of the ora 
Actually applying the operator to G, requires a heuristic . h “a 
type (see Fig. 8-25, heuristic c). An operator Q has two sides, 


becomes 
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the tail of the arrow and one at the tip of the arrow, denoted by L(Q) 
and R(Q), respectively. For instance, if Q is the operator that turns 
cot x into cos x/sin 2, it is written as 


COS & 
cot rt 
sin x 





where L(Q) = cot x and R(Q) = cos x/sin x. A more complicated oper- 
ator might be 


sin? z + cos*z— 1 


In some cases, it might not be possible to apply Q, because G; may not 
be in the appropriate form. In such a case, a goal-related transforma- 
tion of the first type is required, except that now the difference between 
(’, and L(Q) is to be reduced. In any event, a substitution of variables is 
usually required (for example, x for @), and hence when R(Q) is obtained, 
i replacement of variables results in G,_1. 

In order to apply the three types of heuristics of the general problem- 
solving program to a particular problem, certain particular information 
must be given concerning the nature of the structures with which the 
problem is concerned. This is analogous to the particular syntactical 
und semantic description of the object language in terms of the target 
lunguage required in order to apply the translator program in a particular 
case. For the general problem-solving program, the following informa- 
tion must be supplied: (1) the syntactical description of well-formed 
formulas; (2) a list of the available operators [of the form L(Q) — R(Q)]; 
(3) definitions of the kinds of differences that may exist; and (4) a table 
of the connections between these differences and the operators capable of 
producing them [although such a tabulation could be derived from (2) 
and (3)]. In addition, the heuristics must be applied in such a way that 
progress toward the goal is made; hence we must also be given (5) criteria 
for evaluating relative progress toward the goal among different difference 
reductions. We shall now present an example of the use of the program 
(o prove trigonometric identities in the manner of an ‘“‘intelligent’’ high- 
achool student. ‘ 

Example of Proving Trigonometric Identities. As an example of the 
method, consider a problem in trigonometry. Tirst, we must state the 
required information concerning trigonometry, which is as follows: 

1, SYNTACTICAL ENTITIES. Ordinary algebraic expressions, including 
the trigonometric functions. The associative and commutative laws are 
considered as being implicit in the notation: the program can select freely 
which terms to use in an expression like «+ y+ 2. The syntactical 

definitions are similar to those given above (in Sec, 8-3) for the simple 
arithmetic expression but also include the trigonometric functions. 
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2. oppRATORS (a double arrow means the transformation works both 
ways) 
Name Definition 


AO These are recursively defined to be applied from the innermost subexpressions 
to the main expression: 


A+(B+C)7A+B4+C AA-— A? 


A(BC) > ABC ABAC— ABC 
A+004 A°— | 
A+A>2A 04— 0 
A-A->0 AloA 

0A—> 0 (ABC ABC 
1A A (AB)o > AcBe 


Al (A — B)(A + B) & A? — B? 
A2 (B+C)AGBA+CA 
AZ (A +B)? A? + 2AB + B? 











TI tan ro aba 
T2 (tan x)(cot 2) 1 
: sin x 
T3 tanz<o ares 
cos £ 
TA cot xO uate 
T5 sin?a + cos? 2 ol 


3. DIFFERENCES (trigonometric functions are detected by AV and A 
Name Description 


AV ‘A variable appears in one expression that does not appear in the other. 
instance, sin? a is said to differ from sin x tan x by +V since it ne 
tan x; A + cos a is said to differ from A by —V since it needs to lose 
COS &. ; ; 

AC There is a difference in the binary connective. For instance, sin ¢ + ti 
differs from sin x tan x by AC. : 

AN A variable occurs different numbers of times in the two expressions, 
instance, sin « + cos x is said to differ from sin? z + cos x by +N, sit 
needs another sin x; sin? x is said to differ from sin « by —N, since it 
to reduce the number of occurrences of sin «. . 

AT There is a difference in the sign (+ or —) of the two expressions. 


4. TABLE OF CONNECTIONS BETWEEN DIFFERENCES AND OPERATORS 


AO Al A2 ABZ T1 T2 T3 T4 T5 








The +, —, or V in the table means that the operator of the col 
affects the difference of the row, as specified; for double-arrowed 
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ators the differences specified correspond to the respective directions indi- 
cated under the operator name. The ¢ means that the operator should be 
accepted for a ‘‘—” difference if R(Q) contains functions which already 
occur in the expression (or subexpression) and also that it should be 
accepted for a “+” difference if R(Q) contains functions which do not 
occur in the expression (or subexpression). 

5. CRITERIA OF PROGRESS. All differences in subexpressions are less 
important than differences in expressions. For a pair of expressions the 
differences are ranked thus, from most to least important: +V, —V, 
AC, +N, —N, AT. For instance, in sin x cos x versus sin? x + cos 2, 
AC is more important than +N; and in sin x (cos x + cot x) versus 
sin « + (cos x + sin x), —V is more important than AC. 

Having presented the required information, we are now ready to tackle 
aun example. Consider the proof of 


(tan 6+ cot @) sin 6 cos @ = 1 


We shall utilize the block diagrams of Fig. 8-25, starting with heuristic a, 
with G, (which is essentially the hypothesis) being 


(tan 6 + cot 6) sin 6 cos 6 


und Go being 1. The work follows; the student should follow in detail 
the utilization of Fig. 8-25 as the steps progress. Each step in the heu- 
ristic process is labeled in order with an Ha; the subgoals are labeled 
hy Gy: as they are generated. The indention of the lines follows the 
recursive use of the block diagrams of Fig. 8-25 for the heuristic types: 


(iiven Ga: (tan 6 + cot 4) sin @ cos 6 
Vinal goal Go: 1 


H/\. (Try to) transform G, into Go: match produces difference — V (tan @). 
H2. Reduce — V between G;, and Go: first operator found is AQ. 
H3. Apply AO to G,: no change produced; so find next operator 
PA 
HA. Apply T1 to Gh. 
H5. Transform G, into L(T1): succeeds for subexpression 
tan 6, producing 


G1: (1/cot 6 + cot 6) sin 6 cos 6 
H6. Transform G,_1 into Go: match produces difference — V (cot @). 
H7. Reduce —V between G,_1 and Go: first operator found is A0. 
H8. Apply AO to Gy-1: no change produced; so find next 
operator 7'4, 
H9, Apply T4 to Ghou. 
HA0, Transform G,_, into L(7'4): succeeds, producing 


G9: |1/ (cos 0/sin 0) + cow 0/sin 0] sin @ cos 0 
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H11. Transform G2 into Go: match produces difference 
—V (cos 4). 

H12. Reduce — V between Gr_2 and Go: first operator found 

is AO. 

H13. Apply AO to Gi—2: transform} 1/(cos 6/sin @) into 

sin 6/cos 6, producing 


Gy_3: (sin 6/cos 6 + cos 6/sin 6) sin 6 cos 6 


H14, Transform G3 into Go: match produces differen 
—V (cos 8). 
H15. Reduce —V between Gy_3 and Go: first operator foun 
is AO. 
H16. Apply AO to Gis: no change produced; find ne 
operator T5. 
H17. Apply 75 to Gi-s. 
H18. Transform G,_3 into L(7'5): match produ 
connective difference AC (- versus +). 
H19. Reduce AC between Gr-s and L(T5 
first operator found is AO. 

H20. Apply AO to G,-s: no change p 
duced; find next operator Al. 

H2\. Apply Al to Gr-s. 
H22. Transform G,-3 into L(Al 
match produces difference 4! 
H23. Reduce AC between @ 
and L(A1): reject beca 
difference is no easier t 
difference in H19; that 
no progress is being m, 
according to the crite 
Find next operator A2, 
H24. Apply A2 to Gh-s. 
H25. Transform G,—; into L(A 
match succeeds with A 
sin 6 cos 6, B = sin 0/co# 
and C = cos 6/sin 6, produel 


G,_4: (sin 6/cos 6) sin 6 cos 0 + 

(cos 6/sin 8) sin 6 com 

H26. Transform G4 into L(75): mea 
produces difference AC in left-h 
expressions. 
+ AO can be used, since 


con O/sin 0 = [(cos 6) (sin 0)~']>! = (cos 0)" {(ain 0)~'}>! = (cow 0)> sin 0 
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H27. Reduce AC between left expression 

of G,_4 and left expression of L(75). 

H28. Apply AO to left expression of 
Gi_4, obtaining t 


Gis: sin? 6 + (cos 6/sin 6) sin 6 cos 6 


H29. Transform G,_5; into L(75): dif- 


ference AC exists in right-hand 
expressions. 


H30. Reduce AC between right ex- 
pression of G,_5 and right ex- 
pression of L(7'5). 

H31. Apply AO to right expres- 
sion of G5: succeeds, 
obtaining 

Gre: sin? 6 + cos? 6 


H32. Transform G5 into L(7T5): 
succeeds, resulting in 


Gn_z: 1 


H33. Transform G,_; into Go: identical, and proof is 
concluded. 


Thus the lines of the proof become 
G;: (tan @ + cot 6) sin 6 cos 6 
1 
Gs: (a F + cot ) sin 6 cos @ 


1 cos 6 
Gs: : 
"> cos 6/sin 6 + sin 6 


in@ . cosé 
q,: (32 . 
. (= 0 + sin 6 ead 








sin 6 cos 6 








GQ, in 8 : 
73 cos § Sn 8 cos 6 + 





cosé@ . 
ane sin @ cos 6 
cos 0 
sin 6 
G,: sin? @ + cos? 6 
Go: 1 





Go: sin? 6 + sin 6 cos 6 


where we have let h = 7 in Gy_i. 


| Using AO, we find 


sin 0. , } 
et sin 0c @ @ sin @4in 0 (cos 0)~' con 6 = gin? 6 (cos 0)~!(cos 6)! 


@ win? 0 (con 0)° @ (win® 0)(1) @ ain! 0 
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The Heuristic Method. Many additional features can be incorporated 
into the heuristic method. The concept of planning, or constructing a 
solution in general terms before working out the details, can be incorpo- 
rated into the method. Planning can be accomplished by (1) generalizing 
the forms by omitting certain details of the original variables and con- 
nectives, (2) formulating the corresponding problem in terms of the gen- 
eralized forms, (3) solving the generalized problem, where this solution is 
to be considered as the plan for solving the original problem, and (4) 
translating the plan back into the nongeneralized forms and utilizing it 
for solving the original problem. The details are beyond the scope of 
the present treatment. Another important concept that can be included 
in the heuristic method is that of learning, for certainly a human problem 
solver becomes more skilled at solving a particular class of problems as 
he gains more experience in practice. Learning can be accomplished by 
enabling past history (1) to improve the set of differences that are utilized 
and (2) to improve the process of selecting the operators utilized to 
remove the differences, by adjusting the table of connections. These 
and many other additional features of the heuristic method are the 
subject of a great deal of current research. 

In conclusion, we might note how our use of the term ‘inductive 
inference,” as illustrated in this and the previous section, falls short of 
other, more general connotations of the word.+ For we could includ 
under “inductive inference” the discovery of regularities of an environ 
ment, or the formulation of the laws that govern an environment, or t 
prediction of the likely consequences of an action in an environment 
where the process of ‘‘discovery,” “formulation,” or “prediction” woul 
depend on the computer’s inductive capabilities as it “lived in” an 
reacted with the environment in question. 


8-8. Abstraction and Creativity 


Introduction. In this section, we shall avoid as far as is possible d 
cussions of philosophy, epistemology, aesthetics, and other such dialee 
topics, although we must admit that these topics are closely related 
our subject of computer programming to achieve the ‘‘intelligent” ca 
bilities of abstraction and creativity. Our purpose is merely to introdit 
briefly some of the methods of approach that could be taken in using t 
computer for attempting to achieve artificial intelligence in these are 
Consideration of abstraction falls naturally into two categories: 
first is associated with the problem of identifying or recognizing & Cone 
entity as belonging to, or being an element of, a particular abstract 


t See, for example, M. Minsky, Steps toward Artificial Intelligence, Proc. IRB, 
49, no. 1, p. 27, January, 1961. 
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conceptual class. For example, how does one recognize a certain animal 
as a “dog,” or a combination of symbols as a “simple arithmetic expres- 
sion,” or a pictorial figure as the letter “A,” and so forth? The second a 
concerned with the problem of originally formulating the abstraction or 
concept from a plethora of experiences. For instance, how does one 
decide in the first place that animals can be classedt as does ” “cows,” 
ete., or that there is an entity “simple arithmetic éxpressinn? ayhich cat 
be ueetuly distinguished from other mathematical forms, or that “rec- 
Bei ae  etc., are to be considered as distinguishable concepts, 
Programming to achieve creativity, as in music, graphic art, or mathe- 
matics, is closely related to the process of abstraction and will be intro- 
duced through the example of musical composition. In this section 
however, all our discussions will be found inadequate and tettainly 
incomplete ; but as yet the analysis of the problems involved in pro- 
gramming computers to achieve abstraction and creativity has been 
incomplete and is the topic of vigorous current research. If the ideas 
presented in this section stimulate further discussion, if they raise more 
questions than they answer, if they create an awareness of the patente 
ilities of computer capabilities, then we shall have fulfilled our purpose 
Deductive Methods in Abstraction. Consider the first problem ‘a 
abstraction : that of identifying an object as a concrete representation 
of an already understood concept, sometimes referred to as pattern recog- 
nition, Such an identification can be made through either deductive or 
inductive inference, as described in the previous sections. For example 
deductive inference in the form of the logical analysis of Sec. 8-5 fased 
there to identify a disease entity) may be interpreted as a general method 
for identifying an object as belonging to a class according to the con- 
figuration of its characteristics. To carry this interpretation further, the 
result of such attempted identification may involve only a probabilistic 
outcome, in which the object may be considered as a concrete repre- 
sentation of more than one concept. In such a case, further information 
concerning the object would be gathered and another attempt at identifi- 
cation would be made, continuing in such a recursive manner until a 
unique recognition is accomplished. 
. Alternatively, identification by means of deductive inference can 
involve a syntactical-type analysis. or example, the simple arithmetic 
expressions included in a collection of arithmetic symbols can be recog- 
nized through the application to the collection of the translator routine 
(I’igs. 8-4 and 8-5) in conjunction with the syntax of Table 8-4. In the 
syntax given there, the only syntactical relation required was juxta- 


| Nontaxonomically, that is! 
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TaBLE 8-8. SYNTACTICAL RELATIONS FOR PIcTORIAL EXPRESSIONS 


is above 






Representing 










is next to 





is inside of 





is inside on the bottom of 





rests on top of 





rests next to 


position, indicated by adjacency, e.g., (letter)(letter) for AB. Additional 


syntactical relations can enable the syntactical definitions to be formed 
of, for example, pictorial figures. As an illustration, consider the reli 
tions of Table 8-8. In terms of such relations we can syntacticall 

define a four-year-old boy’s concept of a “‘house”’ as follows: 


(door) ::= U 
(windows) ::= & |(windows) > & 
(chimney) ::= ©|9 


(wall) ::= [_]|(door) © [] Ktwindows) © [_] 
(gable) :: = ZA |(chimney) T AA 

(roof) ::= \_\ (chimney) tT \_\ 

(front view) ::= (gable) [ (wall) 

(side view) :: = (roof) T (wall) 


(house) :: = (front view)|(house)(side view) 


Such a description will recognize the pictures of Fig. 8-26a as how 
but will not so identify the pictures of Fig. 8-26b. With this notati 
each of the pictures of Fig. 8-26a can be written in terms of a piel 
expression, respectively as follows: 
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Ato 

(stA)t (© OD) 

[At (© 2) +[(et\\}t (OL) 

[At (a OF) = {fat Ait(@ ©) [Ate +s) ©} 


Inductive Methods in Abstraction. Concept identification, or pattern 
recognition, can also be accomplished through inductive inference by 
means of heuristic programming, which has been described in the previous 
sections. The problem then becomes one of eliminating the “difference” 











lia. 8-26. (a) Pictures recognized as houses; (b) pictures not identified as houses. 


between the given object and the idealized concept of the abstraction; 
if the difference can be completely reduced, the identification will have 
been made. In order to give an example, we must first define the five 
parts of the subject matter upon which the heuristic methods of Fig. 
8-25 are to work. Suppose these are as follows: 


1. opsects. Pictorial expressions as defined above. 
2, OPERATORS 


| B— BB 

Pr JPOCI> CL] 
rm ®®LJ> EC] 
po of VAS LA 


. DIFFERENCES, Only AV and AN occur, and are defined as in 
Hoo, 8-7, 


P \\tLOo= LI 
re tA>A 
pr? (ALO)-OPAIO 


~ 
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4. CONNECTIONS. ¢ means accept if R(Q) contains figures that also 
occur in the pictorial expression: 
Pi P2 P3 P4 P5 P6 PT 


rs 


AV t @ ¢ ¢ ft 
AN t t 


5. CRITERIA OF PROGRESS. Defined as in Sec. 8-7 (for remaining 


difference). 

To identify a pictorial expression as a “house,” we shall choose 
Z\. |] as Go, the idealized concept house, and try to reduce to nothing 
the difference between the given expression G, and this Go: if we are Suc 
cessful, the given expression is a house; if not successful, it is not a house. 
(The “ideal” house is the left diagram in Fig. 8-26a.) Consider as an 


illustration the expression [ At (0 0) (0 t\)t (@ ©L))| i 
which is the second diagram from the right in Fig. 8-26a. The work 
proceeds as follows: 


a: [At Go DY-1et)t oD) 
@: AtO 


H1. Transform G;, into Go. 2 
H2. Reduce AV between G, and Gp (1) : first operator 1s P2 


H3. Apply P2 to Gs, finding 
Ga: (At OO) —[ot XteOD)} 


H4. Transform Gy_1 into Go. ; 
H5. Reduce AV between Gj_1 and G (0): first operator 18 
H6. Apply P2 to G,-1: no change produced; next opera 
is P3. 
H7. Apply P3 to G,_1: no change produced; next opera 
is PA, 
H8. Apply P4 to Gr-1, finding 


(At O)-[At(?OD) 


H9. Transform Gy_2 into Go. 
H10. Reduce AV between G,-2 and Go (#8): first opera 
. is P2. 
H11. Apply P2 to G,-2: no change produced; ni 
operator is P3. 
H12. Apply P3 to Gy~2, finding 


Gis! (A t LC) -((Al C_)) 
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H13. Transform G),_3 into Go. 
H14. Reduce AV between G;_3 and Go: first operator 
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is P2. 

H15. Apply P2 to G,-3: no change produced; 
try P3. 

H16. Apply P3 to G3: no change produced; 
try PA. 

H17. Apply P4 to Gi-3: no change produced; 
try P5. 

H18. Apply P5 to Gy_3, finding 


@« (AtO)- 


19. Transform G)_4 into Go. 
H20. Reduce AN between G;_4and G) ({__]): first 
operator P7. 
H21. Apply P7 to Gy-4, finding 


Gis: At L] 


H22. Transform G,_5 into Go: identical, and hence 
G, is identified as a house. 


One of the most interesting aspects of this formulation of the identifi- 
cation of abstractions, or pattern recognition, is the speculation that this 
is the method most closely related to brain mechanisms. For if one asks, 
say, a four-year-old child to draw a house, he will draw his idealized con- 
ception of a house; this in all probability will actually look like the left- 
hand diagram in Fig. 8-26a. Thus, according to this theory, the child 
recognizes a house by reducing it to his conceptual idealization. Such a 
speculation will explain why a child, when asked to draw a table, will 
ulmost always display four legs and a flat top, completely out of per- 
spective; this is the idealized conception of a table as syntactically 
recorded in the child’s brain! Further discussion of these fascinating 
ideas is, unfortunately, beyond the scope of the present treatment. 

Formulation of Abstractions. The second aspect of abstraction is 
associated with the original formulation of the abstraction, or concept, 
itself. The above study of the process of identifying an object as a 
representation of an abstraction, or concept, can shed much light upon 
this formulation process. Again we can distinguish between concept 
formulation through the utilization of deductive inference and concept 
formulation through the use of inductive inference. In the deductive- 
inference case, for example, we could list the characteristics of a repre- 
sentative sample of the population under consideration, for instance as in 
Table 8-9, For concreteness, suppose the columns represent medical 
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TABLE 8-9. REPRESENTATIVE SAMPLE OF THE POPULATION 
P, P., Ps; Ps Ps Pe Pa Ps Py Pio Pui Pr 
aa a ee reer 
S(1) 1 1 1 1 1 0 0 0 0 0 0 0 
S(2) 0 0 0 0 0 1 1 1 1 1 1 1 
S(3) 0 0 0 0 0 0 1 0 0 0 1 0 
S(4) 1 1 1 0 1 0 0 0 0 1 1 1 
S(5) 0 1 0 0 1 1 1 1 1 0 0 0 
S(6) 0 0 1 0 0 0 0 1 0 +90 0 0 
S(7) 0 0 0 1 0 0 0 0 0 0 0 1 
S(8) 0 0 0 0 0 1 421 is ok 0 0 0 
S(Q9) 0 0 0 0 0 0 0 0 1 1 1 1 
S(10) 0 0 0 0 1 0 0 0 0 1 1 1 


patients and the rows symptoms, as in Sec. 8-5, a 1 indicating that the 
symptoms are present, a 0 that it is absent. In order to determine the 
different diseases that may be involved, i.e., to classify the patients 
according to more or less “typical” symptom combinations—which is to 
say, to formulate abstractions—we construct Table 8-10, which gives the 
number of different corresponding elements Aj;,; that occur in each pair of 
columns, For instance, columns 1 and 2 differ only in row 5, so that 
Ai.2 = 1, while columns 1 and 9 differ in rows 1, 2, 4, 5, 8, and 9, an 
Ai = 6, etc. The table is of course symmetric: that is, Ay; = Aiz The 
A:; can be considered as the distances between pairs of columns. Noté 
that in Table 8-10 we have drawn lines demarcating the three groups 


TaBLE 8-10. Numper oF DirreRING SYMPTOMS FOR Eacu Parr or CoLUMNS 
or TABLE 8-9 












































12 3 45 6 7 8 9 | 10 11 12 
1 01122 5 6 6 6 45 5 
2 1023 1) 4 5 5 5 5 6 6 
3 1203 3 ])675 7 5 6 6 
4 23 30 4 5 6 6 6 6 7 5 
5 213 40/5 6 6 6 4 5 5 
6 | 5 4 6 5 5 ae ie ie OO a ae 
7 6 5 7 6 6 102 2 |} 6 5 7 
8 6 Bb be 6 BB Or Dh 26.7 Fe 
9 65 7 6 6 1220 {4 5 5 
10 45 5 6 4 5 664 /]|01 1 
11 5 6 6 7 5 6575/10: 
12 5 6 6 5 5 6 77 DBL aaeo 


closely clustered columns, namely, columns 1 through 5, 6 through 9, 
10 through 12. Within the first group, the most ‘typical’ column 
column 1, since within the group 1 to 5 it is closest to all the oth 
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Similarly, column 6 is most typical within the group 6 to 9, and column 10 
is most typical within the group 10 to 12. Thus our abstract formulation 
resulted in three diseases, for which columns 1, 6, and 10, respectively, 
present the most ‘typical’? symptom combinations. Such a system of 
abstraction formulations has actually been utilized in taxonomy, to assist 
in formulating taxonomic classifications; it has been suggested for use 
as illustrated in the determination of new diseases. After the classifi- 
cations have been formulated, further analysis might result in the elimi- 
nation of both redundant and irrelevant symptoms or characteristics from 
consideration. f 

In all this we have assumed that the method leads to a solution (i.e., 
that a solution exists), which is reasonable for the definition of disease 
entities arising from different etiologies or for the definition of taxonomic 
categories that arise from different evolutionary branches. That is, we 
have assumed that there exists a ‘natural’ solution, due to a basic 
underlying population structure, in which combinations of characteristics 
should represent an abstract, or conceptual, entity. However, it is 
probably more generally true that abstractions, or concepts, are formed 
by people with “usefulness” as the primary criteria. In such a case past 
experience—namely, the human heritage called civilization—will have 
already defined a large majority of the most useful concepts, and the 
problem changes from one of finding new abstractions to one of learning 
already recognized abstractions. For our purposes, we desire to teach 
the computer to recognize abstractions. 

Much of the research accomplished in this latter area is concerned with 
teaching the computer to recognize patterns, usually letter characters; 
this research has proceeded under various names, including ‘‘neural-net 
simulation,” “self-organizing nets,’ ‘‘perceptron studies,” etc. The 
methods pursued can basically be characterized as shown in Fig. 8-27. 
In its simplest form, there is an input pattern senser, which could be, 
say, an array of photocells upon which the image of a letter is projected, 
each photocell being turned on when the image is light in its vicinity. 
The pattern senser is connected to a matrix of elements, variously called 


“neurons,” “association units,” “connection units,” ete., which in turn 
There is a threshold 6 associ- 


ure connected to a series of output units. 
ated with each of the neurons, such that, if for a given pattern on the 
input senser more than 6 of the inputs to the neurons (from the senser 


{If E[S(1), ... , S(n), D(A), . . . , D(m)] is the Boolean function that embodies 


medical knowledge—see See. 8-5—then a symptom S(j) is called redundant if there 
oxists a k such that 


and S(j) is called trrelevant if it does not appear in any prime implicant of E—see 
Chap, 10, 
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photocells) are on, then the neuron is turned on. A neuron that is on 
bas a certain value V > 0 associated with it; then each output unit has 
a threshold @ such that, if the sum of the values of the on neurons con- 
nected to it is greater than @, then the output unit will go on. The 
thresholds and values are all adjustable, and the problem is to choose 
them in such a way, say, that, when an A is projected on the senser, 
only output unit 1 will go on; when a B is projected, only output unit 2 
will go on; and so forth. The system is said to have formed ae abstrac- 
tion of an A, or of a B, etc., if it can recognize all kinds of A’s, whether 


Connection neurons 


Input-pattern sensef with reinforcement 
capabilities 


Output units 





Fic. 8-27. Technique for teaching the computer to recognize abstractions. 


they be large, small, crooked, rotated, translated, etc., as distinguish 
from all kinds of B’s, and so forth. The system is taught the abstract 
by repeated applications of the following “reinforcement rules 7 7 A 
is projected on the senser; if output unit 1 is not on, the threshol 80 
on neurons connected to output unit 1 are lowered, and their valuew 
raised; if any other output unit is on, the thresholds of all on neuron# 
connected to it are raised as their values are lowered. After many repos 
titions of this process for the various forms of A, the system 1s similarly 


“eonditioned” for B with unit 2 as the associated output unit; and so fort 
ing different reinforcem 


There are many variations to this scheme, US 
including f 


rules, having more than one level of connection neurons, 
back connections between neurons, and so forth, 
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Almost all pattern-recognition methods that include learning use the 
basic ingredients of the two methods described: namely, selection through 
some form of characteristic matching, and value reinforcement guided by 
iterated sampling. The difference between the two methods is that the 
former is set up to determine the concepts or categories from a given set 
of characteristic specifications, whereas the latter is primarily devoted to 
determining a good set of characteristic specifications given the concepts 
or categories. 

However, none of these methods relate to the syntactical type of con- 
cept formation. This is because the syntactical type of abstraction 
embodies a building-block procedure in which previous results are uti- 
lized in a hierarchical and recursive system. Apparently, deductive 
inference cannot contend with such a system, and inductive inference is 
required. 

The use of inductive inference in the formulation of abstractions also 
involves two types of problems: first, that of determining which syntactical 
structures should be defined given the relationships among the entities of the 
system under consideration; and second, that of determining the relation- 
ships between already defined syntactical structures. The former problem 
is primarily concerned with the criteria of usefulness to be applied to the 
system; the latter is the problem that the four-year-old boy faces when 
his parents point out to him ‘‘houses,” “windows,” ‘‘roofs,”’ “chimneys,” 
“doors,” ete. In both cases, trial and error are required, guided by heu- 
ristie procedures that may be built into the human brain, or acquired, 
or both. The frontiers of current research efforts are in these areas, and 
further consideration of these interesting points is beyond the scope of 
our present treatment. 

Creativity. Creativity, whether it be in music or in art, in mathe- 
matics or in physics, does not occur in a vacuum, but rather is affected 
by historical precedent. It is perhaps as much a product of the age in 
which it arises as it is of the mind of the individual from whom it springs; 
although it embodies the most personal of efforts, it derives its sustenance 
from the society in which it develops. Creativity produces structure out 
of disorder, form out of chaos, but the structure and form must meet 
aesthetic requirements as well. Such attributes, among many others, 
must be encompassed by a computer program that is to achieve crea- 
tivity. In our discussion of creativity, we shall first present an example 
of musical composition, { to illustrate the nature of at least one approach 
to the problem, and then give a brief critique of the method utilized. 

Music consists of tone sequences and the rhythm in which they oceur— 
and both these aspects must be considered by a computer program that is 


+ The illustration is taken from L, A, Hiller, Jr., and L, M, Isaacson, ‘Experimental 
Musie,’’ McGraw-Hill Book Company, Ine., New York, 1059. 
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to “compose” music. In the illustration to be described, the underlying 
idea is to generate random notes and random tune-interval patterns, but 
to select from these only those that meet the specifications, or constraints, 
of certain rules of melody and harmony for the notes and of rhythm for 
the time-interval patterns. Thus the problem becomes one of formu- 
lating the rules of selection so that the resulting sequence of tones and 
rhythms will be a musical composition. We shall consider first some of 
the rules of melody and harmony used in the illustration, and then rules 
for the selection of the rhythms. 

In the historical development of musical composition, the rules of 
counterpoint were set forth by Palestrina, a pre-Bach composer of choral 
religious music; the study of musical composition almost always begins 
with a course in counterpoint. These rules are chosen as the selection 
constraints for melody and harmony (the latter for four voices). There 
are three types of rules: those for melody alone—i.e., the horizontal 
sequence of tones; those for harmony alone—i.e., the vertical combi- 
nation of tones that will occur simultaneously; and combined rules 
referencing both melody and harmony. It is not appropriate here to 
give more than just a few samples of each type of rule, although 16 rules 
in all were used. 

1. SAMPLES OF MELODIC RULES. The range from the lowest note to 
the highest note of a melodic line should be no more than an octave: ¢.f., 


oe 
6 oO =<" 
— 
A melodic skip of a major or minor seventh is forbidden: e.g., 


Examples of = — 
forbidden 6 So =< o- 
oe 





Examples of 
octave 
ranges 





























sevenths 


Any melodic skip, or movement of a minor third or more, must he 
followed by a tone repeat or by a step, i.e., a movement of a major 0 
minor second; a melodic step may be followed by a step or by a skip: @. 

















. ki St 
Examples of aL % = 
skip-step a= 7 Qa 
motions bod ae ee 
; Step Step 
Step 


No more than one successive repetition of a given note is permitted, 
2. SAMPLES OF HARMONIC RULES. Permitted intervals include uniso 
octaves, perfect fifths, and major and minor thirds and sixths, I 
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bidden intervals include maj i 
jor and minor seconds and seventh - 
mented fourths, and diminished fifths: e.g., as 


rms gd fg 3 
mess ob 42 


The perfect fourth is permitted only if it does not occur between an 
upper voice and the voice sounding the lowest note of a chord: e.g 


= — 


Permitted Forbidden 















































3. SAMPLES OF COMBINED RULES. Parallel unisons, parallel perfect 
fifths and fourths, and parallel octaves are forbidden: e.g. 


Forbidden 


Unisons Octaves Fifths Fourths 


In proceeding from one chord to the next, at least one of the four voices 
must move by a step motion or remain stationary: e.g., 


Required 
step 6 
motion 


The entire collection of rules required that the processing be accomplished 
in three stages, for the initial notes, the intermediate notes, and the final 
or cadence, notes. The process progresses by repeatedly generating a 
random note and checking it with the rules, recording it if it is per- 
mitted and rejecting it if it is forbidden. When there are only a limited 
number of possible next notes, a random number determines which is 
chosen. The number of notes as well as voices in a melodic line is pre- 
determined, and from this the computer program recognizes the end of 
each such line, for which it is to utilize the cadence rules. 
































To accomplish the introduction of rhythm into the melody, a 2 
8 


four eighth notes to the measure—was used, If no shorter than an eighth 
note is utilized, then a four-bit binary number will represent all the 16 
rhythmic patterns available for a single measure (see Table 8-11). Thus 


meter— 
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TaBLe 8-11. Bastc Ruyrumic SCHEME FOR 8 METER 

Binary number Rhythm 
0000 
0001 
0010 
OO11 
0100 
O101 


0110 


[232 
O111 
1000 
1001 
1010 
1011 
1100 
1101 
1110 
1111 


random numbers could be utilized to generate rhythms. 








common characteristic of music is that there exist correlations of rhyth 
between voices in a polyphonic piece of music. For four voices, 7 
i d: 1111 meant that all four voices woul 
four-bit number was use Die oe 
the same rhythm for the measure, etc. The numbers 0001, on Ol 
were used to signify that the pairs of voices 1-2 and 3-4, 1-3 and 2-4, 


the same rhythm for that measure, 0101 that voices 


1-4 and 2-3, respectively, would have the same rhythms. 


[Cuap. 8 


However, 


In addit 
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another number, from 0 to 15, was used to determine for how many 
measures a particular rhythmic pattern would be repeated. All three of 
these numbers were chosen randomly. 

There are of course many further details to the work, including the 
use of probability distributions for selecting the various random notes 
and numbers, and so forth. In any event, we have tried to give a general 
idea of the approach taken in one instance of writing a computer program 
for the creation of a musical composition. The question arises as to 
whether this is really creativity. Although the experiment is exceed- 
ingly interesting, in the author’s opinion there is a lack of some of the 
necessary ingredients of creativity :} over-all planning or direction is miss- 
ing, leading to a sense of incompleteness; the resulting music is digressive; 
it lacks symmetry and the recursive building up of ideas. 

Two problems may be distinguished—that of imparting form, direction, 
and unity to a particular musical composition, and that of comprehend- 
ing an over-all, across-the-board characterization of a ‘‘style,”’ i.e., an 
abstraction that is recognizable in collections of compositions of a single 
composer. It is well known, for example, that the change in character 
of a composer’s works can often be directly related to the influences of 
other composers, teachers, etc., with whom he associated or studied. 
Can a computer be programmed to grasp such abstractions in a usable 
form? Cana computer program be written that will ‘mix’ abstractions, 
with the generation of new concepts? These are some of the questions 
that concern modern programming research. The answers will probably 
involve the notion of syntactical concept formation, giving the computer 
the ability to comprehend musical abstractions and to use such abstrac- 
tions as a guide to creativity. 


8-9. Exercises 


8-1. Draw a detailed flow chart of a rummy card game. 

8-2. Define a collection of two-address list-processing instructions sufficient for 
programming the flow chart of Exercise 8-1. 

8-3. Write a code using the instructions of Exercise 8-2 for the manipulation aspects 
of the flow chart of Exercise 8-1. Omit detailed consideration of any strategy 
considerations. 

8-4. If a one-address symbolic system of list-processing instructions were to 
he defined, how would the multiaddress aspects of a list-processing instruction 
be handled? (Hint: Use push-down lists.) 


Apply the translator to the following ALGOL statements using the syntax given in 
Table 8-4: 

8-5. A:=((B4+C) XD)+E 

8-6. A:=(BTCT(D+E) XB+C/(D+C)) XE 


| Based on hearing excerpts from the “Tlliac Suite,’’ composed by the Iliac com- 
puter at the University of Ulinois using Hiller and Isaaeson’s program, 
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8-7. How can numbers be incorporated into the syntax of Table 8-4? 
8-8. Using the extended syntax table of Exercise 8-7, apply the translator to 


A:z—B+CT2—4XxA XC 


TaBLE 8-12. ConpITIONAL AND ToTaL PROBABILITIES 


Symptom complex superscript 7 


Apply the translator to the following ALGOL statements using the syntax given in 
Table 8-5: 

8-9. A:=((B+C) XD) +E 

8-10. A:=(BTCT(D+E) XB+C/(D+C)) XE 

8-11. A:=(D+C/(BXATBT(C+D))) XE 


4 | P(d;) 


a YW, Yj Yip Yi) 
Boe —_ o 













8-12. Compare the target-language codes produced in Exercises 8-5 and 8-6 with 
those produced in Exercises 8-9 and 8-10, and explain why the differences occur. 

8-13. Write a syntax table for the translation of a simple Boolean expression of 
ALGOL as the source language to a one-address symbolic machine language as the 
target language. (Define instructions in the target language as needed.) 

8-14. Using the syntax table generated in Exercise 8-13, apply the translator to 


A:=BA(CVD)DE=F 








oo Ze 1.000 Hp = Vy, 00 
A 6 Eee = 0.200 | 0.200 me 
GIVI GY,» os 


P(s‘|d;) 








Disease complex subscript 


8-15. Suppose food F; contains 2 units/lb of vitamin A, 5 units /lb of vitamin B, 
and costs $1 per pound; food F, contains 4 units /lb of vitamin A, 2 units /Ib of vitamin 
B, and costs 80 cents per pound; and food F; contains 3 units/lb of vitamin A, 5 
ante of vitamin B, and costs $1.50 per pound. What diet of these three foods will 
contain at least 80 units of vitamin A and 60 units of vitamin B and be of minimum — 
cost? 

8-16. Carry out the actual computations that resulted in Fig. 8-14. 














8-26. Using deductive inference, show that the picture on the right of Fig. 8-26a 


is a house. 
8-17. Solve the following game-theoretical problem (reproduced from J. D. Williams, 9-97. Using dnduetive i ; 
-27. t . a: O68 
“The Compleat Strategyst,”’ McGraw-Hill Book Company, Inc., New York, 1954): ‘aa hi oe g inductive inference, show that the picture on the right of Fig. 8-26a 


8-28. Compose a set of syntactical definitions for a toy railroad train. 

8-29. Combine the definitions of Exercise 8-28 with those of the definition of a 
house given in the text, and try examples of distinguishing a house from a railroad 
train by deductive inference. 


8-30. Construct a set of operators for the concept of a toy railroad train, where the 


concept is 


8-31. Combine the operators of Exercise 8-30 with those given in the text for the 


concept of a house, and try examples of distinguishing a house from a railroad train 
by inductive inference. 


Frank is hurrying home late, after a particularly grueling day, when it pops 
into his mind that today is Kitty’s birthday! Or is it? Everything is closed except 
the florist’s. If it is not her birthday and he brings no gift, the situation will be 
neutral, i.e., value 0. If it is not and he comes in bursting with roses, and obviously 
confused, he may be subjected to the Martini test, but he will emerge in a position of 
strong one- -upness—which has value 1. If it is her birthday and he has, clearly, 
remembered it, that is worth somewhat more, say, 1.5. If he has forgotten it, he in 
down like a stone, say, —10. What should Frank do? 


























Using the conditional probabilities of Table 8-12 in conjunction with Fig. 8-16, 
make the differential diagnosis for patients presenting the following symptoms: 

8-18. S(1) - S(2). 

8-19. S(1) - (2) - S(3). 

8-20. S(1) - S(2) - S(3). 

8-21. S(1) - S(2) - S(8). 


8-22. Perform the calculations that lead to Fig. 8-19. 





8-32. Use the rules of musical composition as given in the text, and compose a 
musical composition by throwing dice. 
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PART 3 


DATA-PROCESSING TECHNIQUES 


CHAPTER 9 


FUNDAMENTALS OF NUMERICAL ANALYSIS 


_ Numerical analysis is a large and growing subject in itself, and our 
discussion must be limited to those elementary aspects which are con- 
sidered of importance to the computer programmer. These questions 
naturally arise in the study of digital computers: If a computer can 
do little more than the elementary arithmetic operations, how can it be 
used to perform function evaluation, integration, or differentiation, or to 
solve ordinary and partial differential equations? Also, in performing 
long sequences of arithmetic operations, how is the accuracy of the com- 
puted results analyzed and controlled? We can give in this chapter no 
more than a brief introduction to the answers to these questions, but it is 
felt that such an elementary survey can well serve to orient the student 
in this field. 

The chapter starts with a technique for solving linear simultaneous 
equations (Sec. 9-1), which arise frequently in all fields of science and 
engineering. Next, solution of algebraic and transcendental equations 
by means of successive approximations is considered (Sec. 9-2). Then 
the following three sections introduce the fundamental concept of poly- 
nomial approximations for the evaluation of functions. In passing, 
continued fractions are considered, so that the reader will not think 
there are no other methods for evaluating functions. The _ best-fit 
polynomial approximations are emphasized because of their extreme 
importance in digital-computer work. In order that the reader may 
develop a “numerical feeling” for such approximations, graphs are given 
of actually computed examples. The next five sections show how the 
name fundamental idea of polynomial approximation is used in numer- 
ical integration and differentiation and in the solution of differential 

equations. To complete the picture, the method of undetermined coeffi- 
cients is given, so that the student can experiment and derive formulas 
for himself. 

Up to this point discussion of error was omitted to allow full concen- 
(ration on the concepts involved; the final section of this chapter is 
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concerned with accuracy and error. Our discussion will be limited to 
fundamental definitions and to consideration of the arithmetic operations. 
In performing a computation on a digital computer, the significance and 


accuracy associated with arithmetic operations are usually uppermost in- 


the mind of the computer user. Hence an appreciation of the problems 
involved becomes of prime importance to the computer programmer. 


9-1. Simultaneous Linear Equations 


The Problem. Consider a set of n simultaneous linear equations in n 
unknowns 21, ..- 5 Xn? 


A101 + Aire tose H+ Aintn = Y1 
Qoit1 + Age tists H Antn = Y2 
AniL1 + An2w2 + a + AnnXn = Yn 


These equations admit of a unique nontrivial set of solutions, 
u= xi(dis, wee te sAnny YI; oe ot Yn) 


if at least one y; is not zero, and. if the determinant det (a;;) # 0.t I 

order to describe a method for solving the equations, let us first consid 

a special set of equations with coefficients s;, = 1 if 7 = k and sp = 
ifj > k: 

X1 + S1te + 8132s + Sigty = Y1 

Lo + So3t3 + Soaks = Y2 

3 + S34 = Yo 

T= Y4 


These four equations can easily be solved for the unknowns 41, X2, # 
and a, for x4 = y, is already given, and it follows that 


3 = Ys — S34v4 
Xe. = Yo — S244 — S23vs 
X1 = Yr — S1a¥4 — Sigl3 — Sire 


Hence it becomes clear that, if we can somehow manipulate Kqs. (1 
(2), . . . , (n) so that the diagonal coefficients become | and the coe 
cients below the diagonal become zero, then the solutions can be rapid 
obtained. 

The Method. This process can be accomplished as follows: Vi 
divide both sides of Eq. (1) by au. If au = 9, then rearrange the 0 


+ See, for example, B. A. Guillemin, “The Mathematics of Cireuit Analysis,” 
13-18, John Wiley & Sons, Inc., New York, 1951. 
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of the equations so that a1. # 0. (This, of course, c 
: an always be done. 
Why?) This results in J 
ay2 a 
l-2 eR, ates “in ne bi 
1 + ay Xe + + ay vn au (1)! 


Multiply Eq. (1)! by aa, and subtract the result from (2), obtaining 


O- ar + dove + °° + + Dantn = yr (2)! 
Aya 
where boo = xg — — bots ee ees G2;41n 
ay Qy4 
Gor 
and y2 = yo. — Gay 
Qi 


Similarly, multiply (1)! by a3:, and subtract the result from (3) 
obtaining 





O +21 + dgete + °° + + Dantn = ya’? (3)! 
where bo = Az. — 3012 Dan = Agn — Bsi18is 
au 7 11 
and (ss uss 
YR = he 
: y aii 
Continuing in this, we obtain a new set of Eqs. (1)', (2)1, ... , (n)! 


such that bi, = 1 and bj; = 0 forz > 1. Of course, it is clear that our 
new set of equations has the same solution as the original set. 

Next we start with (2)! and divide both sides by bes. [If necessary, a 
rearrangement of the order of Eqs. (2)!,... , (m)' can be made i 
ensure that bes ¥ 0.) We obtain 


bo: a) 
Least past + 5a = B (2)2 
22 22 








Multiply (2)! by bs2, and subtract the result from (3)!, obtaining 





Ove + egst3 + tt A Can%n = Ys (3)2 
bseb 
where ny re eege Le e ss _ Dsrban 
3 boo C3n Dan bag 
and thie y,Oice pie 


Similarly, we obtain 
Cagrs UF 5 ol ak t+ Cankn = ys (4)? 


und the same for (5)*, . .. , (n)*. In this step we have made cs; = 1 
and eg = O ford > 2, 
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Continuing in this way, we finally obtain, for the example of n = 4, the 


following equations, in the desired form for easy solution: 





diz 13 ai4 _ Vi 
ap + — a3; + — U4 = 7 
eet ai a an au an 
b yo 
23 24 2 
at p m= ZT 
is Pala ae bas 
(2) 
C34 Y3 
t3 + — 24 = —— 
C33 C33 
a ys 
{= 
d4a 


Example. Let us carry this process through for a specific set of four 
equations. 
321+ 92+ Gs — 1224 = 9 (a Pawar | 
Dry + Ben + TBs — Br = @) 
3c, + 1222+ 2les— 9x4 = 15 (3) 
—¢,— tet 8est+ 8a, =5 (4) 


Dividing both sides of (1) by 3, we find 


a, + Bao + 2x; — 4a, = 3 


Now multiply Eq. (1)! by 2, and subtract from Eq. (2), obtaining 


Qro + 8a; + Grr = —4 


Similarly, we multiply Eq. (1) by 3 and subtract from Eq. (3), obtaini 


3x0 + 15a; + 3824 = 6 


And finally (4)1 is obtained by multiplying (1)' by —1 and subtracti 


from (4), 
Qao + 1023 + 404 = 8 


Continuing, we form Eqs. (2)’, (3)2, and (4)’, 
ve + 4x3 + 3x4 = —2 


3x; — 6a, = 12 
Qr, — 2x4 12 


ll 


Finally we have 
a; — 2a, =4 
224 =4 
and wy = 2 
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(1)! 


Sec. 9-2] FUNDAMENTALS ‘OF NUMERICAL ANALYSIS 387 
Thus the set of equations from which the solution is obtained is 
xy + 322 + 223 = 40.4 = 3 (1)! 
v2 + 423 + 3X4 = —2 (2)? 
23 — 2a, =4 (8)8 
Gh, = 2 (4)4 
whence t= 2 =2 
w=4+2X2 = 8 
t= —-2-3X2-4xX8 = —40 
wm =3+4X2—2X8 —3 X (—40) = 115 















Figure 9-1 is the flow diagram for this method of solving simultaneous 
linear equations. 










Form equation (j)/ as the result of 
dividing equation (7) 9- 
by the coefficient of x; 


Letj+1—j 


(2)! 


Set k=j+1 


Multiply equation (7 by the 
coefficient of x; of 
equation (2)%-» and subtract 
this result from equation (k)’~?) 
to form equation (k)’ 


(3) 











Lets—l—s 












Letk+1—+k 


IA 


(4) 






Stop the 
computer 


(2)* 
(3)! 
(4) 


Fia. 9-1. Flow chart for simultaneous linear equations. 
9-2. Algebraic and Transcendental Equations 


Successive Approximations. The method of successive approximations 
can be used to evaluate the solutions of algebraic and transcendental 
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and so we try next 0.687, etc. In each step we find in which half of the 
previous interval the solution lies; Fig. 9-3 shows these calculations. The 
fact that we are halving each time means that we are developing the solu- 
tion as a binary number. If the solution lies in the right half of an inter- 
val in a step, then a unit corresponds to this step in the solution; other- 
wise a zero corresponds. Hence the solution x = .10101111 according to 
Fig. 9-3, column yp. 

The flow diagram (see Fig. 9-4) is relatively simple: depending on 
whether sin x; < 2 — 2x; orsin a; > 2 — 22;, we determine the next trial 
solution x;;1 by adding to or subtracting from the previous trial solution 2; 


Although the best method for making the successive oe 
mations depends in any case on the nature of the functions involve , 7 
shall try by means of specific examples to illustrate the a nee 7 
that apply. Often a graph of the functions involved in th e us 10 
will help in the determination of the best method of approximation. 


Consider, for example, the equation 


equations. 


sin z + 2a —2 = 0 


We can write this as sin x = 2 — 24 and graph 


aor nae The desired value of x is the 


yr = sin cand yp = 2 — 2x (see Fig. 9-2). 

















y ‘ : yr YP 

2 @) ah RE ES De ee ee greater | greater 
(1) 0.500 0.479 1 Vv 
(2) 0.750 0.682 0.5 Vv 
(3) 0.625 0.585 0.75 Vv 
(4) 0.687 0.634 0.626 Vv 
(5) 0.656 0.609 0.688 Vv 
(6) 0.672 0.621 0.656 Vv 
(7) 0.680 0.629 0.640 Vv 

1 (8) 0.684 0.631 0.632 Vv 





Fie. 9-3. Calculations for solving the equation sin x = 2 — 2z. 





one-half of the previously used increment Az; Since each trial deter- 
mines an additional bit of the solution, the procedure is repeated the 
same number of times as the number of bits of accuracy desired in the 
solution. 

Accelerating the Convergence of Iterative Processes.{ The problem of 
solving an equation F(z) = 0 can be stated in general as that of finding 
i value (root) of x, say x,, such that F(z,) = 0. The general zteration 
algorithm involves successive approximations 2», %1, %, . . . , which are 
determined as follows: | 

1. If F(z) = 0 can be expressed in the form x = f(x), then we let | 
Inui = f(an). For some initial guess xo, we find x; = f(xo), x2 = f(x1), 
wy = f(w2), ... , continuing until, for some n, |r, — 2n41| is less than 
some preassigned small number, the allowable error. 

2. If F(x) = 0. cannot be written in the form of x = f(x), then since as 
n> ©, F(a) > 0 and any — an > 0, we let any1 = Xn + OF (an), where | 
# is some suitably chosen nonzero constant, and proceed as in case 1. 





x 
X, «—- XgXuXq X2 1 














Fic. 9-2. Finding solution to sin x + 24 —2 =0 by the method of successive 


proximations. , 
i he intersection of the two curves, where sin & Wl 
a: oh We know that the value of « must be ie wean oa 
hence we try x: such that 0 < a1 < 1. As can be seen ene i } 
if sin x; > 2 — 21, then the solution must lie to the left o ru 7 
x < «1; if sin 2, <2 — 2x, then the solution must be to the ir 0 2 
that is, v > 21. Suppose that we try 21 = 0.500 (radians, 0 a 
then sin x, = 0.479, and 2 — 2x, = 1. Hence 0.500 < a < er: pe 
try «2. = 0.750 (that is, half the distance between 0.500 “ee >i a 
sin rz. = 0.682 > 2 — 2x2 = 0.5, whence 0.500 < x < oT 7 e | 
a3 = 0.625 (half the distance between 0.500 and 0.750); then 
























































{ This process was worked out by J. H. Wegstein and P. Henrici; see J. H. Wegstein, 
Accelerating Convergence of Iterative Processes, Communs, ACM, vol. 1, no. 6, 
pp. M13, June, 1058, 








sin a; = 0.585 < 2 — 2x, = 0,75 
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Let us consider case 1, for what we shall say also holds for case 2; i.e., some value between x, and 2,41. For instance, in the case illustrated 
Fig. 9-5 describes the process. Since x = f(x), the object is to find the in Fig. 9-5, q should be chosen so that ; 
intersection of the two curves y = & and y = f(z). From y = f(an) we a 
me Seem oes 
1—q AC 












Of course we do not know the ratio BC/AC, but we can determine it 
approximately. For PC = BC, and thus 





Set x;= 27" 
and Ax;= 27? 
Set n equal to the number of 

bits of accuracy in the 
solution 


Form sin x; 













(XngtrFn41)) 
(X%n421%n+2) 


Put 1 in the 27th position of solution 


Ax, 
AX 511 = > (the interval) 


Put 0 in the 2-‘th position of solution 


Ax; . 
AXj41 “> (the interval) 















(Xn41,%n41) 


(Xn f(%n)) 





Hip Xj AX L1 Xp =X tAXiy1 








Fig. 9-5. The iterative process. 


Add 1 to tally 


ot 









from the calculus). From the definition of the derivative, m can be iI 


Stop computer approximated by 


f(%n) ane f(&n—1) — tn — Ln 


m= 












































































































































Fic. 9-4. Flow chart for solving the equation sin x = 2 — 2a. Big ad a a | 

Hence we can take 
find y,, point A; from y = 2 we find Yn = n+, point B; from y = f(# q _m I! 
we find yny1; from y = a we find Yny1 = Tn+25 and so forth. Pago md ) 
| 








The iteration process just described can be accelerated, however, 
observing from Fig. 9-5 that instead of taking 241 for the substitut 
into f(x), we would do better to choose a value «*,, for the substitut 
where 
























































The iterative process can then be described by the flow chart of Fig. 9-6, 
wherein we must observe carefully when «* or 2, is used. 
As-an example, consider the equation 


2%, = gtn + (1 — g)tn+1 am by(ew — oo) 





392 DATA-PROCESSING TECHNIQUES [CHaP. 9 





Xai = Oat (Ll —Q)ingt 





Let x*_,=x% 
xA=xXh41 
Is the solution 
sufficiently accurate ? 


Fia. 9-6. Accelerating the convergence of the iterative process. 


TaBLE 9-1. EXAMPLE oF Boru THE ORDINARY AND ACCELERATED ITERATIVE 
Processes FoR THE Roor x = 0 or x = 14(e7-5* — e°-5*) 








an by ordinary zn by accelerating | Corresponding 
" | iterative substitution | iterative substitution | value of m 
0 1.000 1.000 
1 —0.521 —0.521 
2 0.263 —0.00348 0.340 
3 —0.132 —1.32 X 1075 0.335 
4 0.066 —1.65 X 107! 0.333 
5 —0.033 
6 0.017 
7 —0.008 
8 0.004 














Fra. 9-7. Comparison of ordinary and accelerated (dashed line) iterative processes 
the « = Oroot of « = }g(e7%-5* — e%-5), 
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With a = —0.5, let us solve for the root « = 0, using both the ordinary 
and the accelerated iterative processes. Starting with zr» = 1, we have 
Table 9-1 and Fig. 9-7. 

The ordinary iterative process may result in four types of behavior. 
The successive values of z, can (1) oscillate and converge, (2) oscillate 
and diverge, (3) converge monotonically, or (4) diverge monotonically. 
The accelerating method just presented will make the nonconverging 
types 2 and 4 converge and will accelerate the convergence of types 1 and 
3. We have illustrated the situation for type 1. Exercises 9-9 to 9-11 
will illustrate types 2 to 4. 


9-3. Function Evaluation : Series and Continued Fractions 


Series Approximations. In the previous section we have tacitly 
assumed that the values of the functions entering into the equations 
could somehow be obtained by the computer for any given value of x. 
lor polynomials no problem arises, since the evaluation of a polynomial 
is easily coded. On the other hand, how are the values for sin x to be 
computed for a particular 2? We shall discuss in this and the next two 
sections three ways in which such functions may be evaluated by the 
computer. These are (1) by power-series approximations, (2) by interpo- 
lation in a table, (3) by means of “‘best-fit”? polynomial approximations. 
Of course our primary purpose is to discuss functions that are not them- 
selves polynomials or fractions of polynomials, since the methods apply 
trivially for polynomials. 

Many functions can be easily expanded in power series, and then the 
functions can be evaluated by means of these series for values of x within 
the region of convergence of the series. For example, 


: + nr ne 
snz—-%—- ate at 


Then if sin 0.5 were required, the computer would be coded to compute 


0.53 | 0.55 0.57 

OOS Ser gk yal 

If it were desired to know sin 0.5 to within +0.0001, then the series 

would be carried out until some term were less than 0.0001. For exam- 

ple, sin 0.56 = 0.5 — 0.02083333 + 0.00026042 — 0.00000155, where we 

stop. Hence, to the desired accuracy, sin 0.5 = 0.4794. Another way 
of interpreting this result is that the polynomial approximation 





+ ::: 


zt, x 


in* 
sin*a=a2—-y+5 
Bli; Bf 


in sufficiently accurate for our purposes when « = 0.5. We write sin* x 
instead of sin w to indicate that this is just an approaimation to sin aw. 
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On the other hand, some series do not converge quite so rapidly—for 
example, 
ph i ah ae! 


arctang =2—-—Zte oat coe 
For « = 0.5 we find 
arctan* 0.5 = 0.5 — 0.041666 + 0.006250 — 0.001116 


which obviously is not accurate to +0.0001. 
Often other expansions of a function can be found that might converge 
more rapidly than the power series; e.g., for arctan © we have the series 


2 a 2:4 a! | 





3i¢a+3-304m)* 


Continued-fraction Approximations. There are other expansions 
functions that often aid in their evaluation: for example, continu 
fractions such as 


x 
arctan x = al + 


arctan « = - 
x 


Le 


2 
3 4 4a 


9x? 


162? 
Be sera 


a+ 


It can be shownt that a continued fraction can be easily evaluated 
means of 2 X 2 matrices as follows: Given the continued fraction 
b 
Pp =a, + mmeace: t 
Seca 3 
4 
a rae 
i Speman. ae 
G4 ats: 
the nth convergent Pn/qn is the result obtained when all terms from n + 
on are neglected; e.g., the fourth convergent is 


PA ay + Bs 


b 
as a2 + 3 


bs 
ae 





The nth convergent may be evaluated from the following matrix equatio 


Pn Pn-1 = ai 1 a2 :) (¢ ) ae (s ,) 
G. Ge) 1° Obs 07 XB, 0 b, 0 


+ See L. M. Milne-Thomson, ‘The Calculus of Finite Differences,”’ p, 108, 
Macmillan Company, New York, 1951, / 
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From this we can find p, and gn, and also pr_-1 and qn—1. For example, 
for arctan 0.5 we have 


(2 2) =G a) Gs 0) Gas 0) o) Gas o) 
qs gs) \1 0/\05 0/\0.25 0/\1 0/\2.25 0 
_ ( 59.3750 8.00 
~ (128.0625 ea 
Hence 
ps _ 59.3750 a 8.00 
mo 10 | ras 
Thus arctan 0.5 = 0.4636, to within +0.0001 (since the difference 
between the fourth and fifth convergents was 0.0001). We see that the 
infinite-fraction expansion of arctan 2 converged faster than its power- 
series expansion. 

There are of course many other kinds of expansions of functions— 
for instance, Fourier-series expansions. However, the evaluation of a 
function by means of an expansion is not always the preferred method. 
Hence other means are often resorted to, and we shall consider a few of 
them briefly in the following sections. 

Differences of a Polynomial. Since, as we shall see below, the poly- 
nomial approximation plays a central role in numerical analysis, it is 
important first to consider some very interesting properties of the poly- 
nomials themselves. These properties involve the so-called ‘differences 
of a polynomial.’”? Consider a polynomial y(x) whose values have been 
found at equal intervals Az; = h: that is, yo = y(xo), yi = y(ao + h), 
yo = y(to + 2h), .... The differences of successive values of y, 
namely, 


3 


= 0.4637 


| 





Ayi = Yiti — Ys = y(tigr) — y(@) 


ure called the first differences of the polynomial. The differences of suc- 
cessive first differences are called second differences: 


A?y; = Ayinr — AY: = Yin2e — QYin1 + Yi 
Similarly we can define third differences, fourth differences, and so on: 


Ay; = Ayn — Aye = Yiss — 8Yi¢2 + 8YH1 — Yi 
Aty; = A®yin1 — A®ys = Yess — 4Yses + Bye — 4yig1 + Yi 


In general, the jth difference is given by 


Aly, = Yiti — (7) Yits—1 + (3) Yijw-2 — °° + (12% 


- d, (-1" (2) was 
he 
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An interesting observation about such finite differences of a polynomial 
is that if the polynomial is of degree n (that is, n is the highest exponent 
of z with nonzero coefficient) then A"y; = k, a constant. For example, 
consider the polynomial 


y = 2? —x+1 
The first and second differences are tabulated in Table 9-2, in which we 


TABLE 9-2. DIFFERENCES OF A POLYNOMIAL OF DEGREE 2 


—— 





x y Ayi Ay; 
0 1 
1 
1 2 4 
5 
2 7 4 
9 
3 16 4 
13 
4 29 4 
17 
5 46 














have written the differences between the corresponding lines of the tab 
To see that the A2y; are constant, consider the general equation of degree 


y = ax? + bu +e 
Then 


a(a +h)? + b(a +h) +e 
ax? + (2ah + b)a + (ah? + bh + ¢) 


y + Ay 


On subtracting the former equation from the latter we obtain 


Ay = 2ahx + (ah? + bh) 

Again, 
Ay + A(Ay) = 2ah(x + h) + (ah? + bh) = 2ahx + (Bah? + bh) 
On subtracting, we find A(Ay) = A’y = 2ah?, which is the desired 
stant. The same procedure for a general polynomial of the nth deg 
namely, 
y = at" + bar ee ee 

yields 

Avy = a[n(n — 1)(n— 2) +++ I= ahn! 
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Thus, if the values of a polynomial of degree n are tabulated at equal 
intervals, any error could be detected by computing all the Ay; and 
checking for constancy. 

Error Propagation of Differences. Let y; (¢@ = 0, 1,2, ... , 10) be 11 
correct values of the function y(x), not necessarily a polynomial, for equal 
intervals of x. Suppose, however, that in the table of differences, Table 
9-3, ys was in error and y; + ¢ had been recorded. The table shows how 
this error is propagated. Thus, for example, 


Ys te — ys = (Ys — Ys) +e = Ays + € 
yo Yet) = Yo — ys) —e = Ays — € 
Ays + € — Ays (Ays — Ays) +e = A’ys; + € 
(Ays — «) == (Ays + €) = (Ays — Aya) —2e= A’y4 — 2e 


I 


and so forth. Note that the coefficients of the e’s are just the binomial 
coefficients with alternating signs and that the algebraic sum of the errors in 
each column is zero. The maximum error in the differences is in the same 
horizontal line as the error in the function table. Thus, by taking a suf- 
ficient number of differences, an error in the table of a function can be 
identified. However, in practice it rarely pays to take more than the 
fourth differences, for beyond this the round-off errors themselves begin 
to accumulate. 


TABLE 9-3. PROPAGATION OF ERRORS IN DIFFERENCES 










































y Ay Aty A’y Aty 
Yo 
Ayo 
y1 A*yo 
Ayn A*yy 
i Atys A‘yo = 
Aye Ary, nee 
Ya A2ys Be ne Aces 
Ays Last eg Fr Ay, + € 
ne A*ys + € Aty, — 4e 
Aya + cr aera Aty; — de onccerrerm 
Ys te ------- < A’yy — 2e Y—---—~—-- «! Aty; + 6e ‘pe 
Ays-¢€ “77-77 ~~ Ady, + Be SHH 
Yeo | Rese. A’y; te Aly, — 4e 
Aye Te cae A®y;, — € 
yn A*y6 A Ss 2 A'y; +e 
Ayr Asy6 ieee 
ys APyz A'ys See 
Ays Asy; 
Vo A*ys 
Alo 
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9-4. Function Evaluation : Interpolation 


Polynomial Interpolation. Until now we have been considering fune- 
tions that are memorized in the computer as subroutines. For instance, 
if y = f(z), then for a given 1 the computer will determine f (a1) by means 
of a program, which might be based on some expansion of f(x). On the 
other hand, it is often impossible to write a program for a certain function: 
e.g., the function might be empirical data, or perhaps the subroutine 
might require too much time or too much memory space. In these and 
other instances it is often convenient to record values of f(x) in the 
computer memory in the form of a function table which gives the f(a) 
corresponding to a set of specific values of x;. It appears at first glance 
that this method has great limitations, since only those values of f(a) 
which correspond to a relatively few values of 2, are in the memory of th 
computer. However, by means of interpolation, values of f(x) corre 
sponding to an x not listed in the table can be approximated. 

Suppose that it is desired to find the value of f(x) for a value of zn 
listed in the table. It may happen that there are listed in the tab 
some values of f(a) for x; near x. Then, in order to find the value of f(z) 
it seems plausible to draw a smooth curve, y = f*(x), through all t 
points listed in the table and to say that y = f*(x) as read from t 
curve is an approximation to y = f(x) (see Fig. 9-8). Such a process 
called interpolation and is most often accomplished by means of a pol 
nomial approximation. 

If n + 1 points (2o,yo), (1,41), (a2,Yo)) - + +» (nyYn) Of f(x) are list 
in the table, then a unique polynomial of the nth degree, f*(a), can 
derived that passes through all these n + 1 distinct points. For let 


y = f*(2) = Aot+ Art + Asa? + oes f+ Ana 


If f*(xo) = yo, f*(t1) = Yr - + + > f* (an) = Yn, then these are n + 
equations which can be solved for Ao, Ai, .-- An. Once these val 
have been obtained, we can find for any x’ ~ x, the value of f*(2’). 
more points considered, i.e., the higher the degree of /*(«), the clo 
will be the value of f*(2’) to f(x’). Let f*,(a) represent the polynom 
of first degree that goes through the points (2o,Yo) and (2x1,y1); let Je ya6 
represent the polynomial of second degree that goes through the poitt 
(x0,yo), (t1,y1), and (x2,Y2); ete. Suppose that we want the interpolath 
f*(a’) to within an accuracy of 0.001. To obtain this, we would caleulay 
successively f*,(2’), f¥1,2(2’), f*s0,3(2'), ete., until 


\fo.1 sys ,-1(2") — Prnur(2)| < 0,001 


and then we would take fj,,,,,,(@) as our result, 
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Usually, the more known points used in an interpolation, the better 
becomes the accuracy of the interpolation. However, if the additional 
points are far from the point of interpolation, they may actually make 
the approximation worse. Similarly, with the same number of given 
points, the smaller the interval in which they are chosen, the more 
accurate the polynomial approximation in that interval. To illustrate 


are tan x 








0.4 
(Differences 
0.3 slightly 
exaggerated) 


1.0 


Via. 9-8. A third-degree polynomial drawn throu i 
gh the four points (—1, —0.785 
( —0.2, —0.197), (+0.2,+0.197), (+1, +0.785), as an approximation to arctan x, 


shown by the solid line. Here f*(z) = arctan* x = - 
ee eee ice f*(z) = arctan* x = 0.9952 — 0.21023. The dashed 


this latter point, in Fig. 9-9 we have drawn the error curves for two poly- 
nomial approximations to arctan x. Each error curve has x as abscissa 
and the difference between the actual value of arctan x and its polynomial 
approximation as the ordinate. The first approximation is based on the 
nix points —1, —0.6, —0.2, +0.2, +0.6, and 1; the second approxima- 
tion, on the six points 0, 0.2, 0.4, 0.6, 0.8, and 1. Figure 9-9 illustrates 
the error curves for 0 < «<1. Note that in this interval the maxi- 


mum error in the former case is over forty times the maximum error in 
the latter case, 
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Obviously the sequence of calculations for a polynomial interpolation 
would be quite lengthy for each desired value if all the coefficients had 
to be calculated first, and then f*(x’). The process can be shortened by 
calculating f*(2’) directly, at essentially the same time as the coeff-cients 
are calculated. That is, the coefficients are never explicitly given, but 


error 
0.0025 


0.0020 


0.0015 


0.0010 


0.0005 





06 0.7 08 O09 41.0 


—0.0005 


Fic. 9-9. Comparison of the error curves for the interpolation polynomial of arctan* 
based on the six points —1, —0.6, —0.2, +0.2, +0.6, +1 (the curve with the bi 
hump) and based on the six points 0, 0.2, 0.4, 0.6, 0.8, 1 (the curve that hugs the 
axis). These error curves indicate just how much the interpolation polynomials di 
from the true value of arctan z. 4 


f*(a’) results directly. It should be remembered, however, that t 
meaning of f*(2’) is the value at x’ of the approximating polynomift 
that goes through the specified points. 

Aitken’s Method. One method for shortening calculations, call 
Aitken’s method, follows.f First note that 


Yo Lo — & 


ok Y¥ewmi—7e 
Z =>-—lll 
S016 ) x1 ato 


} See W. E. Milne, “Numerical Calculus,” pp. 68-72, Princeton University 
Princeton, N.J., 1949, 
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Xo— 2}. : ; 
where : a 7s the determinant with value yo(a1 — x) — yi(ao — 2). 
1 Ly 

For 

A a 

oe Yr L1 — Xo _ Yolt1 — Xo) _ 
Fialao) = HE =H 
Yo Lo — X11 
0 

and So:(a1) = ee =y 


Since ff, (x) goes through the points (xo,yo) and (21,1) and is of the first 
degree, it is the required (unique) polynomial. Hence, for a given num- 
ber 2’, to find ff ,(2’) we just calculate Eq. (a) with this number 2’ sub- 
stituted for the symbol z. 

; The polynomial that goes through (20,yo), (a1,y1), and (a2,y2) is given 
xy 

f 01(2) %1— 2% 
fo.2() ve & 

G2 —- XH 











fo1,2(%) = 


where ff(x) is the polynomial that goes through (xo,yo) and (22,y2). 
For f%1,2,3(@) we have similarly 








fort) t2-—2 
ft.3(2) 2 —2 
fio g(x) = Lens 
* aa(t) C) 


The process can be extended in this way. For example, 


| f0:1,2,3,4,5(2) L5 — x 
J0.1,2,3,4,6(%) ve — x 
Us — v5 








F0:1,2,3,4,5,6(2) = 


Ilence the process can be systematized, with f*(x) being recomputed, 
including more and more points, until the desired accuracy is obtained 
(assuming that there are enough points in the table). The following 
scheme can be used for performing the calculations, the terms in each row 
being determined in order, until f*(z) at the end of a row is of the desired 
nccuracy : 


%o, Yo (to — 2) 


a NS oN 

Va, Ve ta 2 0,900 0,1,200 

Xs (ty x) Se y(x) to ‘ (x) to rh (x) 

tu ye (ta — 2S") — Bio fal 
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9-5. Function Evaluation: Best-fit Polynomial Approximation 


Polynomial Approximations. As we have noted in Sec. 9-3, the evalu- 
ation of f(z) by means of terms up to the x" term of its power-series 
expansion can be considered as approximating f(x) by a polynomial of 
degree n, where the coefficients are given by Taylor’s formula. Similarly 
the method of interpolation can also be interpreted as the approximation 
of f(x) by a polynomial, even though the polynomial may never be 
explicitly determined by the method given; the coefficients of the interpo- 
lation polynomial of degree n are determined from n + 1 known values 
of f(x). For arctan x we have the power-series approximation 


arctan* « = 1.0000002 — 0.33333323 + 0.2000002° 


For the interpolation approximation (for the points —1, —0.6, —0.2, 
+0.2, +0.6, and 1 given in Sec. 9-4) we would have 


arctan* « = 0.999105a — 0.306897x? + 0.0931902° 


In order to show how good these approximations are, in Fig. 9-10 havi 
been drawn the error functions 


e(v) = approximation — function = aretan* « — arctan x 


for the range 0 < x < 1, for both cases. 

Best-fit Polynomials. The question immediately arises: Does the 
exist another fifth-degree polynomial that is a “better fit” to arctan 
for0 <a <1? We say that ff (x) is a better fit to f(a) than f¥ (x) for 
given range of « if within that range max lex(x)| < max |eo(x)|. In gem 
eral the answer to the above question is “yes.” or arctan x the best-fi 
polynomial for 0 < x < 1 is given by 


arctan* « = 0.9953542 — 0.288679a* + 0.0793312° 


Figure 9-10 also shows (x) for this approximation. How one arriv 
at such best-fit approximations is beyond the scope of this book. t 

The advantage of using a best-fit approximation over a Taylor-seri 
approximation or interpolation is easily seen: fewer computations 
required to obtain the same accuracy than for a Taylor series; and 1) 
tables need be memorized in the computer, as is necessary for inter 
lation. Hence time and space are saved in the computation. Althou 


+ For such a discussion see Cecil Hastings, Jr., “Approximations for Digital Co 
puters,” Princeton University Press, Princeton, N.J., 1955; and H, H, Den 
Computer Generation of Optimized Subroutines, J. Assoc, Computing Mach. 
8, pp. 104-116, January, 1961. 
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€(x) 
for interpolation 
polynomial 


e(x) 
for best-fit 
polynomial 





€(x) 
—0.0010 for power-series 
polynomial 
€(x) 


I'1q. 9-10. A comparison, by means of error curves, of three fifth-degree polynomial 
approximations to arctan x for0 <2 <1. 
1. Power-series approximation 
arctan* « = 1.000000z — 0.33333323 + 0.20000025 
2. Interpolation approximation on the points —1, —0.6, —0.2, +0.2, +0.6, and 1: 
arctan* x = 0.9991052 — 0.306897x* + 0.093190z° 
3. Best-fit approximation: 


arctan* x = 0.995354a — 0.288679x3 + 0.07933125 


we have illustrated a best-fit polynomial that is good to within + 0.0005, 
best-fit, polynomials can be formed that will give any desired accuracy. 
lor example, a best-fit polynomial for arctan a, for 0 < x < 1, which is 
good to within +0.0000 01, is given by 


aretan” « = 0.9999 7726 a — 0.3326 2347 a + 0.1935 4346 25 
— 0.1164 3287 27 + 0.0526 5332 2° — 0.0117 2120 a2 
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Best-fit approximations need not be of the forms shown for arctan 2 
but should be of some relatively simple algebraic form. For example, 
for0 < x < o, a best-fit approximation for 


oP) “2 
fas. i edt 
Vr Jo 
is given by 
1 
(1 — ayx + aoa? + azx ® + agrt)4 


ial) ee 





where a; = 0.278393, a2 = 0.230389, a3 = 0.000972, and a, = 0.07810 





Zz 
Fia. 9-11. Error curve for f(x) = wall e* dt. 


vx Jo 


In this case ¢(x) is given in Fig. 9-11, plotted against ~/z. 
easily coded for a computer. 


This f*(x) 


9-6. Integration 


By Definition. In this section we shall see how a computer can in 
grate. In his previous studies the student has been taught to integr 
by means of the manipulation of abstract symbols. However, a lit 
reflection will show that it is really numbers being handled, these n 
bers being represented for convenience by symbols. Of course, & 
puter cannot directly manipulate symbols in the ordinary sense, b 
can handle numbers. As we shall see, the simplest method of hand 
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numbers to obtain an integral is to follow the very definition of the 
meaning of integration. 

Recall now the definition of integration (see Fig. 9-12). The integral 
of f(x) from x = 0 to x = p, written ii f(x) dz, is the area between the 
curve y = f(x) and the z axis, bounded by x = O anda = p. It is found 
by considering the rectangles formed 
by partitioning the x axis between 0 
and p into n intervals each of the 
same length h, where h = (p — 0)/n. 
If there are n intervals each of length 
h, the area of the 7th rectangle will 
be hy; and the sum of the areas of 
all the rectangles will be 


oe > 


In another partition that has more 
intervals the length of each will be 0 
less. Hence, as the number of inter- 
vals becomes infinite, the length of 







\ 







\ 





V/; 









ral 


et N 


YY 





SSSSSSSSS QW) 


RS 


SSE... |WDN 
SSS A.A_A_A_ AN 


” DSL LL 





WM) 





lA, 





Fra. 9-12. Numerical integration. 


cach approaches 0. The integral ie . y dx is defined as the limit of the 


sum of the areas of the rectangles as the number of intervals increases 
and the length of each approaches zero, 


‘lo integrate on a computer, we could just form > hy;. Then we might 
i=l 
2n 


take a smaller interval, say h/2, and form ) (h/2)y:. If 


t=1 


2n n 
> >, hw 
t=] t=1 


is less than the allowable error desired for this integration, then the 
integration is completed. If not, we could try intervals of h/4, and 
wo on, This process naturally assumes that we have a subroutine to 
evaluate yy @ f(a) for each a under consideration, 
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By the Trapezoid Rule. Although the above method is a good way 
for evaluating an integral, there are methods that may require less com: 
putation. Suppose that we considered the trapezoids formed by adding 
little triangles to the rectangles, as shown in Fig. 9-138. Here the area 
of the trapezoid is h[yo + le(yi — yo)| = (h/2)(yo + y:). For n trape= 
zoids we have A = (h/2)(yo + 2y1 + 2y2 + °° * + Yn). Certainly this 
will be a better initial approximation to the area under the curve than wa 
A=hA(tyity2t+ °° * +4yn). Hence, using this method, we shall not 
need to go to as small intervals to obtain the desired accuracy as we did 
using the previous, cruder method. The method given in this paragraph 
is called the trapezoid rule. 


if 
%2 

Fia. 9-13. Trapezoidal approximation to 
integration. 


y 





/ 


L) 


X9=0 x, =h x2=2h 


The parabola 
y* 


Fie. 9-14. Parabolic approximation 
integration. 


By Polynomial Approximations. An even better initial approximatit 
can be obtained by considering three points at a time instead of t 
Suppose that we consider the area under the parabola y* = a + bx + ¢ 
that passes through three points, as shown in Fig. 9-14. Then 


2h 2 3 ]2h 
[ (a+ be tex!) de = or + + S| 
0 0 


= 2h-a+ 2h?-b + 8h? +c 


Ao,2 


ll 


To determine the coefficients a, b, and c, we note that 


If x = 0, a+ 0°'b+ O'c=¥Yy 
If zx =h, at h:'b+ Weec=y 
If « = 2h, a+ 2h:b+4h?:c= yp 


Src. 9-7] 


These, when solved for a, b, and c, yield 
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b = Yet 4y1 — 3Yo ¢ = U2 = 241 + Yo 
2h 2h2 


bos = 5 (yo + du + 1) 


a= Yo 


whence 


Similarly Ao4 = (h/3)(y2 + 4ys + ys), and so forth, whence, if n is even, 
we have 


A= (yo + dyn + Que + dys + Que t+ dys + rte sb Ayn + Yn) 


This is known as Simpson’s rule. 

Similarly we could try to make even better initial approximations to 
the area under the curve. Once the method of approximation is decided 
upon, more accurate results can be obtained only by increasing n, the 
number of intervals—as can be seen from the original definition of the 
integral. 

We have been assuming until now that the function y(x) can be 
evaluated at any point x by a subroutine. Suppose, on the other hand, 
that the function f(x) were given by a table of points, spaced at equal 
intervals 2; Then the above procedures would of course still hold. 
But these methods will evaluate integrals only between the points that 
ure given in the table, and not other points. However, one might choose 
simply to integrate the approximating polynomial (determined as in the 
previous sections) and then to evaluate this function between any two 
points, whether they are or are not given in the table. 


9-7. Differentiation 


By Definition. The derivative of y = f(x) with respect to a, taken at 
v = 2p, is defined by 


dy\ 
Ot Jamey 


and represents the slope of the tangent drawn to the curve y = f(x) at zo. 


Hence, if f(a) is given at x = a and 2 = 2, an approximation of the 
derivative might be 


Az—0 Ax 





f(s) — f@) 


V1 — Xo 


Unfortunately, with computers it is not always feasible to take 2, suc- 
cessively closer to x) and evaluate the fraction each time, for x1 — xo 
becomes very small, as does f(7,) — f(a), and often for a very small 
v, = xo the loss of significant figures throws the calculation far off, 
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By Polynomial Approximation. Suppose that f(x) is given at Xo = 0, 
a, = h,and 2, = 2h. Analogous to the process described for integration, 
we might determine as an approximation the derivative of the parabola 
that goes through these three points. Let y = a + bx + ca?, where a, b, 
and c are as in Sec. 9-6. Hence 

dy =y2 + 4yr = 3yo 4 Y2 = 2ys F Yo , 


Bae Viale ee = ah he 





Then for the derivatives taken at % = %, = %, and x = &, respe 
tively, we have 
dy 1 
asi ee ee (oe 4 ean 
(2) oh (—3yo + 4y1 — y2) 


y=Yo 


dy 
(#). = 5h ( Yo+ Yy2) 


y=" 


dy 1 d 

= == — 4 BY 

(#) oh (Yo yi + Bye) 
y=Y2 

The formulas are equally valid for any three points (x0,yo), (@1,y1), & 

(x2,y2) equally spaced on the x axis, since translating the curve so th 


y =f*(x) 






if 
y= fla), 


Fic. 9-15. Approximating the derivative from an oscillating polynomial. 
to = 0 does not affect its slope.t If the function is given by means of 
table, then these rules can be used for evaluating derivatives at poit 


given in the table. 
Of course there are always the approximating polynomials of f(x), W 
can easily be differentiated, the resulting derivative being evaluated 


the requisite points. 


+ See, for example, Milne, op. cit., p. 96. 
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However, numerical differentiation at best is often quite inaccurate. 
For these methods must be based on approximating polynomials that 
depend on values of the function at a few specifie points. There is no 
practical way for the computer to tell how the approximating polynomial 
oscillates around the given function. Hence the slope of the polynomial 
at a point may be entirely different from the slope of the function, even 
though the curves lie very close, or even touch (see Fig. 9-15). On the 
other hand, the value of the area under the curve is seldom so sensitive 
to these oscillations. 


9-8. Undetermined Coefficients 


The Method. As was shown in the previous two sections, formulas 
for the integral or derivative of a function can be obtained by integrating 
or differentiating the polynomial approximation functions that pass 
through some given points. However, in those sections formulas were 
specifically derived only where three points were fitted. Formulas 
based on more points can be derived by the method of undetermined 
coefficients,t which follows. We shall develop the method by means of 
illustrations. 

Suppose that we wish to rederive the formula of Sec. 9-6, Simpson’s 


rule, and find an approximation of , . y dz based on the polynomial 
y = f*(x) of the second degree passing through the three points (2o,¥o), 
(1,y1), (#2,4Y2) with equally spaced abscissas. Since the value of an 
integral does not change if it is translated along the x axis, it will prove 
helpful to take advantage of symmetry, by letting the x coordinates of 
these points be v7» = —h, «1 = 0, % = +h. Now let 


Aoyo + Airy: + Aryo = [Ryde 


where Ao, Ai, and A» are the as yet undetermined coefficients. This 
formula must be exact (i.e., not an approximation) for y = f(x), when 


f(x) is any polynomial of degree 2 or less, because there is a uniquely 


determined polynomial of degree not more than n that passes through 
n +1 points. Hence this formula must be true for the functions 
y = 1, y=2, and y = 2. Substituting these three functions succes- 
sively in the equation, we find 


| 
bo 
> 


Ag+ A, + A, = 
—hAo +0: Ait hAz = 
h?Ago + 0° Ai + h2Ag = 26h3 


| 
oO 


{ See ibid, p, 104, 
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Solving for Ao, A1, and Ae, we find 
Ao = Ao = Yh and Ay = 46h 
x2 h 
y dx = 3 (yo + dyn + ye) 


zo 


Hence 











as desired. 
Suppose that we now desire to find a formula for (dy/dz) z=», based on 


four points with equally spaced abscissas, (a0,yo), (€1,Y1)5 (a2,y2), and 
(ta,ys). Let to = —h, 1 = 0, v2 = h, x3 = 2h, and 


d 
Boyo + Bry + Boyz + Bays = (24) 


r=2, 


where Bo, Bi, Bz, and B; are the undetermined coefficients. This formula 
must be exact when y is any polynomial of degree not more than 3, and 
hence for the polynomials y = 1, y = 2, y = x, and y = x. Substitut- 
ing these successively into the equation, we find 


Bo + Bit Bet B,=0 
h?Bo + 0 5) By + h?Be + 4h?B; 7 0 
—hBo + 0 . B, + heBs + Sh3B3 = 0 
These, when solved for Bo, Bi, Bo, and Bs, give 
1 1 1 1 
Bo = — 3) Bi= — 5), er ee 3 and Bs = — & 
d 1 
whence (H) = Fr, (—2yo — 3y1 + 6y2 — ys) 


9-9. Differential Equations 


Meaning of Solution. A differential equation describes how a syst 
goes from a known point to the next infinitesimally close point. 
instance, the well-known equation dv/dt = F/m describes the relation 
of the velocity v of the mass m to the time 1, when a force F is appli 
Given a point (f0,v0), the equation says that at an infinitesimally shi 
time later, say at t:, the velocity will be v1 = (F/m) (ti — to) + 
Such equations arise because in nature relationships between physi 
quantities often reduce to linear functions at the infinitesimal level. 

Consider, for example, a differential equation of the first order in t 


form 


The solution is to be determined as a function of x only, that is, y = ¢ 
Of course y = g(x) can be represented as a curve, and the differen 
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equation tells us how to draw this curve. 


(ao,yo), must be known or given. 
that 


One point on the curve, say 
Then the differential equation says 


d 
(#4) = f(x0,Yo) 


oe hence we know the slope at (xo,yo). A linear approximation to 
e next point might be obtained (see Fig. 9-16) by continuing along 


y 





x 





Xo x) x2 x3 


Fie. 9-16. Linear approximation fro 
¢ ; m Fra. 9-17. Polynomial approximati 
point to point along a curve. from point to point along eae nnn 


the tangent line, i.e., by choosing an 2 and determining y,; from 


Yt = Yo + (41 — 2) (7) 


r=Z9 


Using (21Y1) we find that (d dx =r, — 1 
) y/ )e Ly i 
, f(x Ya) and then, choosing an 


Yo2= + (22 — X1) (#4) 


and in this manner (2x3,y3), . . . are determined and the function y = g(x) 
is approximated. Of course this is just an approximation, even if we 
make 1 — Lo, 2 — Ly, Tz — X . . . small, for the differential equation 
taken literally implies that x: — xo, %2 — 2, . . . should be infinites- 
imal. What we are essentially saying by this method is that a straight 
line through (x,yo) is a good approximation to g(x) near (xo,yo) and 
therefore the next point (x1,y1) ought to lie very close to it. 

By Polynomial Approximation. Consider again a first-order differen- 
tial equation dy/dx = f(x,y). A better approximation to y = g(x) might 
be obtained if we started with two points, (ao,yo) and (21,41). Then we 
could use a parabola as the approximating curve of g(a) heat (xo,yo) and 
(v1,y1), to determine (x9,¥,) (see Fig. 9-17), Take, for exaniple, the 


tad 
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second formula for the parabolic approximation of Sec. 9-7, 


dy me oe 
(HY) = on | Yo + Ye) 


2 d 
From this Yo = Yo + 2h (#4) 
Hence if we take 11 = %o +h, x2 = x) + 2h, etc., and if we know (20,y0) 
and (#1,y1), then we can determine (dy/dz),-:, = f(%1,yi), and from this 
ye. Our equation from Sec. 9-7 can be written in general as 


d 
Ynti ae Yn-1 + 2h (4) 


Thence we find y3 for 73 = x9 + 8h, using (dy/dx),-» = f(x2,y2), and so 
forth. This process is clear, provided that initially we know (a1,y:) 
as well as (a0,yo). Given (20,yo), we can find (x1,y1) by forming a Taylor- 
series expansion around (2 ,yo). Since we already know dy/dx = f(x,y), 
then 
2 3. 2 
tS ay) T¥= Fo flow) ote. 


for the coefficients, and hence (x1,y1) is easily found, 


d = 0)? (a? 
Yi = Yo + (a1 = Xo) (4) af fae (G2) _ 


(x — x)3 as 
a 3! (2) 


where the first few terms are usually sufficient. 

Similarly, higher-degree approximation polynomials can be used to 
estimate successive points. For example, a cubic might be used that 
is based on 


d d 
(Xo,Yo) (1,41) (#4) and (#4) 


Such formulas can be derived by means of the method of undetermined 
coefficients. 
Example. As an example, consider the differential equation 


dy _ 
Fg a 


Suppose that, at x) = 0, yo = 0. Choose a; = 0.1, a = 0.2, xy = 0,8, 
etc. Then, by our first, linear method, 
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Y1 = Yo + (1 — Xo) (2). = yo + (1 — 2) + yo”) 
=04 (0.1-—0)1+4+0) =0.1 
yo = yi + (a2 — 11) (#4) = 91 + (@ — m)(1 + yr’) 
= 0.1+ 0.1 X 1.01 = 0.201 
Ys = Y2 + (a3 — 2X2) (#4). = yo + (a3 — ae)(1 + yo?) = 0.8050 


and so forth. 
Next consider the quadratic method. We must first find y, for 
v, = 0.1. Taking the first three terms of the Taylor expansion, 


= 2 
Yr = Yo + (1 — o)(1 + yo?) + (= ad 2yo(1 + Yo?) 


3 
4 as ar (2(1 yor)? + A4yo2(1 + Yo)| 
0+ 0.1 + 0 + 0.0003 


= 0.1003 
Then 


1 
n= 02 (4H) + yo = 0.2 X (1+ yr) + yo 
= 0.2 X 1.0101 + 0 = 0.2020 
dy 


n= 0.2 (4) +y=02X 04+ y%) +y1 
: = 0.2 X 1.0408 + 0.1003 = 0.3085 


and so forth. The exact solution to dy/dx =1+y? is y = tana. 
With greater accuracy, we have yo = tan0 = 0, y; = tan 0.1 = 0.1021, 
Y2 = tan 0.2 = 0.2035, and y3 = tan 0.3 = 0.312. 

Since in the numerical solution of a differential equation each point 
depends on previous points, errors are cumulative. Hence the only way 
to get a better solution is to take the interval A smaller for a particular 
method of approximation. 

By Runge’s Method. There are formulas for the integration of differ- 
ential equations other than those which can be derived directly by 
the method of undetermined coefficients. One well-known example is 
Runge’s method. This has the advantages that it has very small error 
and that it requires no special methods to start the solution. It is, 
however, based on Simpson’s rule given in Sec. 9-6 above, which for 
y = F(x) can be written 


woth h 
Ao(P) = / F(x) dx = rf 


vo 





F(x) + a (2 + 5) + F(x + m)| 
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If we have y'(x) = dy/dx = f(x,y), then 


ylao + h) = ylae) + Ay = yoo) + f2" ye) de = y(oo) + Acaly’) 


That is, Ay = Ao,a(y’); or, applying Simpson’s rule, 
h 
Ay = g Yo + 4y1 + ¥) 
Clearly y, = f(xo,yo) can be computed. The approximation 
h h 
ne st(a + 7 yo + + vt) 
is reasonable since it involves only h/2. However, 


f(ao +h, yo + yoh) = me 


is too crude an approximation for ys, and hence Runge’s method uses 
Yo = f(xo + h, yo + meh). Summarizing, then, the method uses these 
formulas: 


Mo = f(Xo,Yo) 
Mm, = f(z yo + m5) 


mo = f(x + hyo + moh) 
ms = f(ao + hyyo + meh) 


from which Ay = ; (my + 4m + ms) 


Figure 9-18 gives the geometric interpretation of the different slopes 
Mo, M1, M2, and ms used. 


9-10. Partial Differential Equations 


Obtaining numerical solutions to partial differential equations can be 
a difficult task. Ideally the nature and properties of the solutions should 
be known beforehand. This knowledge can be obtained through mathe- 
matical analytic investigations and also from the behavior of the physical 
situation that the equation describes. The uniqueness of the solution 
must be assured, as well as the continuity of the solution in the region 
under consideration. There are many methods for solving partial differ- 
ential equations numerically, some of which depend strongly on the nature 
of the particular equation under consideration. In this short section we 
can do no more than discuss some of the concepts upon which these 
numerical methods are often based, a more advanced treatment be 
beyond the scope of this book. 
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Yotmeh 








h 
rots 


Fia. 9-18. Illustration of slopes mo, mi, m2, and m; used in Runge’s method. 


Difference Quotients. A difference quotient is the finite approximation 
to a derivative; it is the quotient whose limiting value is the derivative. 
For example, if f = f(x), 


. h 
is the difference quotient corresponding to df/dz (see Fig. 9-19a). Simi- 
larly the “backward” difference quotient 


j, = f@=se—W 


could be formed. For functions of two variables, difference quotients 
corresponding to partial derivatives can be formed. For example, if 
u = u(a,y), then corresponding to du/dx we have the difference quotients 


— u(a +h, y) — ulx,y) 
ty h 


AL u(y) = wa —hy) 
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and corresponding to du/dy we can form 
a, — Wey +h) = ule) 








¥ h 
_ ulz,y) — ulz, y — A) 
Ug h 












These can be visualized in Fig. 9-19b. Second difference quotients, 


x-2h x-h x xth x+2h x 
(a) One-dimensional array 









19) (x+h,y) 





a oo 











x 
(b) Two-dimensional array 


Fia. 9-19. Lattice arrays for visualizing difference quotients. 


corresponding to second partial derivatives, are difference quotients of 
the first difference quotients. Thus we have 








Ue — Ue uUlath,y) — 2u(a,y) + ule —h,y) 
ar aid he 

Uy — Uz — u(x, y +h) — 2u(z,y) + ula, y — h) 
ES he 


Analogous difference quotients can be written for functions of more than 
two variables and visualized in lattice arrays of more than two dimensions, 
The solution to partial differential equations is cast in the form of 
operations with the dzfference equations formed by replacing the partial 
derivatives by their corresponding difference quotients. For example, 
consider the equation (known as Laplace’s equation) 
atu | ou 
dx? ' ay? 
The corresponding difference equation becomes 


u(x +h, y) — 2u(x,y) + u(x — h, y) 
h?2 





+ way + oe ule y =) Oe 
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If we solve for u(x,y) the ‘‘meaning”’ of the equation becomes apparent, 
for 


u(x,y) = Y4lu(@ +h, y) + ua — hy) + u(a,y +h) + ulz,y—h)] (3) 


Thus the equation “says” that the value of u at any point is the average 
of the values of u at the four adjacent lattice-array points. As a physical 
example of such a situation, consider a square of metal as in Fig. 9-20. 
Suppose that the temperatures u at points along the edge of the square 
are fixed and that the temperature of the rest of the square has become 
stationary, i.e., is no longer changing with the time; then Laplace’s 
equation (1) holds. That is, according to (3), the temperature at any 
point in the square is nearly the average of the temperatures at the four 





Fia. 9-20. Temperatures on a metal square to illustrate Laplace’s equation. 


adjacent points—this last remark holding only when h is sufficiently 
small, of course! 

Thus the solution to (1) is found by determining the values of the 
function uw at all lattice points so that (3) is satisfied. Then a smaller 
interval A is chosen, and (3) is satisfied again for this finer array of lattice 
points, and so forth, until the values at common points determined in 
two successive steps do not change appreciably. Summarizing, the 
numerical solution to the partial differential equation is the array of 
final values of the function at the lattice points as determined by both 
the given boundary values and the equation itself. For example, in 
lig. 9-21 the vertical axis represents the temperature value of our square 
as computed for the boundary temperature values indicated by the 
dashed curves. Clearly, therefore, the values comprising the solution 
need be no more accurate than the given values at the boundary. 
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By Iteration of Difference Equations. The method of solution of par- 
tial differential equations by iteration is best described by means of an 
example. We shall therefore consider the Laplace equation (1) with the 
boundary values of Fig. 9-21. The first step is to start with a crude, or 
coarse, lattice and make a guess at the solution, i.e., at the values of the 

















Fic. 9-21. Solution to Laplace’s equation for the temperature example, with bound@agy 
values as shown. 


function at these lattice points. For example, Fig. 9-22 illustrates i) 
coarse lattice with appropriate guesses (top numbers in each list) at the 
lattice points. In this case it is reasonable to take as an initial guest 


the weighted average of the four associated boundary points as the value 


of an interior point. For instance, for v1 we formed 


Quro + 201 + Wis + Us = 3.4 
411," 0 aeulne ee, ee Oe Oe . 








Src. 9-10] FUNDAMENTALS OF NUMERICAL ANALYSIS 419 


and so forth. Of course in a realistic situation all auxiliary information 
would be brought to bear on the initial guess and if possible a much finer 
lattice would be chosen. With the initial guess completed, the difference 
equation is utilized to successively and systematically adjust the values 
at each point by utilizing the values of the adjacent points. Each time 
a computation is made, the most recent 
values of the adjacent points are used. 
For our example we use Eq. (3) and 
determine successively wi, Wis, U1, and 
us. Thus 


Un = 14 (ur + U1o + Var + uo1) = 3.3 
Next wi. is computed from 
U2 = M4 (ui + Ui + U22 + uo1) = 5.9 


where the most recent value of wi 
enters the calculation, and so forth. 
All the points of the lattice are suc- 
cessively recomputed over and over Fic. 9-22. Initial values and first, 
again until they no longer change. RP HTAC SB WEE CORD UNaTICHI: 

The next step is to increase the number of points on the lattice, say by 
halving the interval h (see Fig. 9-23). Since new points appear, initial 
guesses must again be made. Here interpolations can be made for new 
points that lie between two old points; the other new points might be 
given the value of the average of adjacent points. We choose a simple 
linear interpolation and average for the illustration. The initial guesses 
having been made, the iteration proceeds row by row for all points, the 
entire process being repeated until the values of all points cease changing. 
A finer lattice can then be used, and so forth. The resulting function 
u(2,h) has been shown in three-dimensional form in Fig. 9-21. 

By Relaxation of Difference Equations. For each fineness of array 
lattice points considered in the numerical processes, our aim is to deter- 
mine the values of u;; so that the difference equation (associated with the 
partial differential equation) is satisfied at every lattice point. The pro- 
cedure is first to guess at initial values and then to keep systematically 
adjusting these until the equation holds at every point; then a finer array 
of lattice points is chosen and the procedure repeated. In the previous 
paragraph we considered an iterative method for adjusting the u;; values 
it any array stage; here we shall consider another method for performing 
the adjustment of w,; values, called the relaxation method for satisfying 
the difference equation, Again this method is best described by means 
of an example, As before let us choose Laplace’s equation with differ- 
ence equation (3) for the illustration, 


U30 u31 32 U33 
9.9 





0 3.3 5.6 7.1 
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The method of relaxation is concerned with the so-called residual R,; at 
each point, which is the discrepancy in the value of u at the point that 
accounts for the difference equation not holding. 
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6. 7.1 


Fig. 9-23. A finer lattice calculation than that of Fig. 9-22. The boundary valu 


were taken from Fig. 9-21. 
For our example, the difference equation (3) can be written as 
(wiper + Ui ja + Wins + Ue-1s) — 4; = 0 
[where we have replaced u(x + h, h) by wisp, U(x — hy y) by Wije 


u(x, y +h) by wr, and u(x, y — h) by w-1,5)- Thus the residual 
point (i,j) can be expressed as 


Riy = (Wij. + Wager 1 Wega, + Ui-1,3) — AU (4 


and the purpose of the relaxation method is to try to make the residu 
at, every point become equal to zero. This is accomplished by approp 
ately adjusting the value of u;. Two problems therefore remain for d 
cussion: first, by how much should wu; be adjusted, and second, after Uy 
adjusted, how are the residuals at neighboring points corrected to aceo 
for this adjustment? 
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If in our example we adjust the value of wu; by Auw,, then from (4) the 
residual R,; is altered by an amount AR;; given by 


AR,; = -—4 Aus; 


But we desire that the final residual be zero, i.e., that R, + AR, = 0 


, 
whence 


and we must therefore adjust u,; by an amount 
Auy = 4Riy 


which solves the first of the problems mentioned above. Next consider 
the new value of the residual of an adjacent point, say R,j41. This 
residual, which was originally 


Rii+1 = (Uii42 + ug + Ugg + Ui—1,341) — Ui,j41 


becomes now 


Rises + AR 541 = [ise + (uig + Avis) + vig + Wass] — Unga 
Hence 
AR i j41 = Au 


giving the needed change in residual of the adjacent point. Similarly for 
the other adjacent points, AR;,;-1, ARi+41,;, and AR,_1,; are all equal to Au;;. 

Now we can completely describe the method of relaxation as follows: 
First, compute the residual at each lattice point. Second, choose the 
point with the largest residual, and adjust the value of that point accord- 
ing to a formula derived from the associated difference equations so that 
the residual there becomes zero. Third, recalculate the residuals at the 
adjacent points to account for the new value of the adjusted point. Then 
the residuals are again scanned and the largest residual again chosen, and 
the process continued until all the residuals become zero at the same time. 
This process of relaxation is accomplished for each fineness of array lattice. 


9-11. Accuracy and Error 


Sources of Error. We can distinguish among three contributions to 
the error of a numerical computation: 

1. First is the error due to the approximate nature of the numerical 
analysis of the computations. As we have seen in previous sections, 
frequently the value of a function or the solution to an equation can 
theoretically be obtained only by performing an infinite sequence of 
iterations, Of course the infinite process cannot be completed, and so 
we must stop with some iteration in the sequence, accepting this as an 
adequate approximation to the desired results, The resulting error 
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is called a truncated error and often can be made as small as we please. 
The problem here then is to estimate the accuracy of the result after an itera- 
tion to determine whether or not it is as precise as required, 1.e., whether or 
not more iterations should be made. 

2. Second is the error due to the approximate nature of experimental 
data used in the calculation; this is called the experimental error. Here 
the problem is so to analyze the experimental data as to obtain the best possible 
estimate of the measurement involved, i.e., to determine the accuracy of the 
experimentally obtained numbers. T he accuracy of such numbers 
depends on aspects other than mathematical considerations, and therefore 
from a computational point of view these errors are treated as initial 
constraints on the problem. 

3. Third is the error due to the use of approximate numbers in perform- 
ing the numerical calculations; such errors are called round-off errors. 
Any number that we write is necessarily in finite decimal (or binary) 
form, which is only an approximation to its infinite decimal conception. f 
For example, might be approximated by 3.1415927. Numbers can be 
approximated to any degree of accuracy: e.g., a better approximation to 
x is 3.14159 26535 89793. Experimental numbers may also be con- 
sidered to be finite approximations of some (unknown) infinite decimal. 
Here of course we cannot get any better approximation than those deter- 
mined by methods associated with the above-discussed second source 0 
error. Thus the problem here is to perform the four arithmetic operations t 
a manner that retains the maximum desired accuracy under the circumstances 

Consideration of truncated errors forms a large subject in itself. Fo 
the various iterative formulas given in the previous sections, formul 
can be obtained for the errors, or for bounds on the errors, after an 
particular iteration. For example, the remainder formula for a Tay] 
series is familiar to the student as a measure of the error of approximati 
of a finite number of terms of a power series. Textbooks on numeri¢ 
analysis consider these formulas in detail; this subject is, howeve 
beyond the scope of the present brief treatment. Consideration 
experimental errors is also beyond the scope of this treatment. 

This section then considers the last of these three problems, the roun 
off errors. The importance of this subject, to numerical calculato 
cannot be overestimated. During a computation it is not an overstal 
ment to say that consideration of errors that may arise from arithmeti 
operations should be uppermost in the mind of the user of the comput 
The discussions of this section merely form the basis for such error ¢o 
siderations in a particular computation. Very often it is necessary f 
the user of the computer to determine experimentally the behavi 
of his numbers during the computations, by having the computer perlo 

+ Here we consider, for example, the number 0.5000 + + + to be an infinite decin 
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series of trial computations. These experimental computations can be 
used to determine the correct operational procedures. 

Significant Figures and Round Numbers. The accuracy of a number is 
measured in terms of the number of significant figures it contains. Any 
one of the digits 1, 2, 3, 4, 5, 6, 7, 8, 9 is a significant figure; 0 is some- 
times but not always a significant figure. To determine when 0 is a 
significant figure, we distinguish three cases: 

1. The zero lies to the left of all other nonzero digits of the number, 
e.g., as in 0.00123. Here the zero is used simply to fix the decimal point, 
and therefore it is not considered to be significant. None of the three 
zeros in 0.00123 is significant. 

2. The zero or zeros lie between nonzero digits, e.g., as in 3,002. Here 
the zeros are considered to be significant. 

3. The zero lies to the right of all other nonzero digits, e.g., 123,000. 
In this case it cannot be determined from the number itself whether or 
not the zero is significant; it must be determined from the context in 
which it was written. Often, if the zeros in 123,000 are not significant, we 
would write 123 X 103, while on the other hand if the first zero were 
significant we would write 1,230 x 10? to indicate this. 

For example, the number of significant figures in each of the following 
numbers is four: 1,234, 9,072, 5,006, 0.008379, 0.01024, 6,080 X 10°, 
7,000 X 10%. The leftmost significant figure in a number is called its 
most significant figure; in our examples the most significant figures are, 
respectively, 1, 9,5, 8,1,6,and7. The least significant figure is the right- 
most significant figure of a number; in our examples the least significant 
figures are, respectively, 4, 2, 6, 9, 4, 0, and 0. 

A number composed of n significant figures is said to be correct to n 
significant figures if its value is correct to within 14 unit in the least sig- 
nificant position. For example, if 9,072 is correct to four significant 
figures, then it is understood that the number lies between 9,072.5 and 
9,071.5 (that is, 9,072 + 0.5); if 0.01024 is correct to four significant 
figures, then it lies between 0.010245 and 0.010235; if 6,080 X 10° is 
correct to four significant figures, it lies between 6,080,500 and 6,079,500. 

Another term used to describe a number is the number of decimal places 
of the number, being the number of digits to the right of the decimal 
point. Thus 0.008379 has six decimal places, and 0.37 X 10-* has seven 
decimal places. 

To be used during a computation, numbers that are infinite decimals 
must be converted to finite-decimal approximations. Similarly numbers 
that contain more significant figures than are desirable for a particular 
computation must be approximated by numbers of fewer significant figures. 
The process of reducing the number of significant figures of a number is 
called rounding. Numbers that are rounded so that they contain 7 
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significant figures are formed so that they will be correct to n significant 
figures; they are then called round numbers. Rounding can be accom- 
plished by means of the following rule: To round a number so that it is 
correct to n significant figures, retain the n leftmost significant figures, and 
discard the rest; if the discarded number is greater than }4 unit in the 
least retained significant figure, then add 1 to this last retained figure; 
if the discarded number is less than 14 unit in the least retained significant 
figure, leave the latter unchanged. The case where the discarded number 
is exactly 14 unit in the least retained significant figure is often handled 
thus: If the least significant figure is even, leave it unchanged; if odd, 
then add 1 to it. By means of this latter rule the errors due to rounding 
will tend to cancel, since even and odd digits occur on the average with 
equal frequency. Also, by leaving the rounded number in this case as 
an even number we increase our chances that the result of a division 
might be exact. As examples, the following numbers have been rounded 
to four significant figures: 


Number Rounded number 
3.141|5926 3.142 
0.1428|57142 0.1429 
0.005329|601 0.005330 

421 .2|4899 421.2 
58 .76|5000 58.76 
93 .29!5000 93.30 


Relative Accuracy and Absolute Accuracy. The error in a number, 
compared with its true value, is stated in various ways. The absol 
error of an approximate number N* from its true value N is defined 


« = |N*— J 


The relative error e, is defined by 


The percentage error e, is defined by «, X 100 per cent. The relative- 
percentage-error concepts have the advantage that they are independe 
of the unit of measurement, whereas the absolute error is expressed 
terms of the unit used. For example, 3.14 is an approximate val 
of the infinite decimal 7; the absolute error of this approximation 
[3.14 — | < 0.002 (r — 3.14 = 0.00159 - - -); the relative error 
|3.14 — 1|/3.14 < 0.002/3.14 < 0.0006; the percentage error ¢, < 0, 
per cent. 

The accuracy of an approximate number can be considered from ¢ 
points of view. The first, and that most often considered in using di 
computers, is the concept of relative accuracy. In this case the aceur 
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is measured by the relative error and not the absolute error. As will be 
shown two paragraphs below, the number of significant figures of an 
approximate number is a measure of the relative error and hence of the 
relative accuracy of the number. It is for this reason that significant 
figures are so important. To see the importance of relative error as an 
index of the accuracy of a number, consider the measurement of the 
0.002-in. diameter of a human hair to within 1/100,000 in., as compared 
with the measurement of the 240,000-mile distance from the earth to the 
moon, to within 2 miles. The relative error of the measurement of the 
hair is 0.00001/0.002 = 1/200, and the relative error of the measure- 
ment of the distance to the moon is 2/240,000 = 1/120,000, while the 
absolute errors are 0.00001 and 126,720 in., respectively. Clearly the 
measurement of the distance to the moon is 600 times more accurate 
than that of the diameter of the human hair, even though the absolute 
error of the former is over 12 X 10° times that of the latter. 

On the other hand, the absolute accuracy of a measurement is often 
important from another point of view. For example, in the tolerance of 
machined parts it is the absolute error that counts as the measure of 
accuracy; hence the allowed absolute error is frequently stated explicitly 
by writing the desired sizes as finite decimals followed by + and the 
tolerance which is the maximum allowable absolute error. In this case 
it is the number of decimal places that plays the important role. 

The relation between the number of significant figures and the relative 
crror inherent in an approximate number is given by the following 
theorem, in which e, is the relative error, K is the most significant figure 
of the number, and n is a positive integer: 

If a given number is correct to n significant figures, then 


1 
ier ge 
** > OK xX 10"! 


When the most significant figure is not explicitly known, we can say that 
the relative error ¢, < 1/(2 X 10"~1), For any radix q the 10°-! in 
these formulas is replaced by g"-!. If g = 2 (binary), then the inequali- 
ties are read e, < 1/2”. 

To see why this theorem is true, note that N* > K xX 10?-! (for exam- 
ple, 521 > 500 = 5 X 102), where p locates the decimal point with respect 
to the most significant figure K.t Now since N* represents an approxi- 
mate number, correct to n significant figures, i.e., with an error of 14 unit 
in the last (nth) significant figure, then e < % x 10°-" and 


lg & 10>" 1 1 
tee PCTS en ee ae ee 
& SVR IN*® x lo” SOR xX 101 


{| For example, 8,000 would be written 8 X 10° = 3 x 10°, whence p = 4. On 
the other hand, 0,008 would be written 8 & 10°" « 3% 10°", whence p = —2, 
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If K is not known, then to ensure that the inequality will hold for every 
K we choose the smallest, namely, 1. 

This theorem is of important practical value in using a digital computer 
having floating-point arithmetic operations. It enables the evaluation 
of the number of significant figures that must be carried to ensure that 
the relative error be kept below a given amount; and conversely it 
enables the evaluation of the relative error for a specified number of 
significant figures. 

Significant Figures in Arithmetic Operations. Since numerical compu- 
tation as performed by a digital computer reduces to addition, sub- 
traction, multiplication, and division, the problem of retaining significance 
during these operations becomes fundamental. We shall here give rules 
of thumb commonly used by mathematicians regarding the control of 
significance during arithmetic operations. In any particular situation 
the rules serve merely as a guide; the details of the process usually 
depend on the particular numbers and errors involved and hence must 
often be carefully examined. In many situations it ts just as bad to 
overestimate an error resulting from a computation as it is to underestimate 
one. 

Consider addition first. The rule of thumb is that in adding two 
(positive) numbers there is usually no loss of significant figures; in fact 
the result sometimes has one more significant figure than either of the 
operands. 

For example, suppose that 52,761.2 and 71,436.7 were rounded 
52,761 and 71,437 and added: 





Rounded Unrounded 
52,761 52,761.2 
+ 71,437 + 71,436.7 
124,198 124,197.9 
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is of course an exception to our rule, chosen so that the errors in the least 
significant figures of the operands were maximum and were added 
together. It is clear that, if one always includes one more significant 
figure than is necessary, one will be more sure of having at least the 
required significance in one’s result. 

Subtraction, however, is another story. The rule is simply a warning 
that complete loss of significance is possible and that each case must be 
examined on its own merits. For example, suppose that we are forming 
the difference of 52,763.5 and 52,762.5 as rounded to 52,764 and 52,762: 








Rounded Unrounded 
52,764 52,763 .5 
—52, 762 —52,762.5 
00, 002 00,001 .0 


Although the subtrahend and the minuend were correct to five significant 
figures, the difference has no significant figures; i.e., the 2 is off by a full 
unit. Most generally there is not a complete loss of significance, but 
very frequently severe loss of significance occurs. There are two reme- 
dies that can be tried. The first is to use many more significant figures 
in the subtrahend and minuend to begin with and hope that enough 
significance will result for the remainder. The other is to try to trans- 
form the expression to something else and thereby avoid the subtraction 
altogether. For example, if one had to evaluate 1 — cos x for small a, 
it might be wiser to write 1 — cos x = 2 sin? (a/2) and evaluate the 
latter expression. 

For multiplication the rule is that up to two significant figures can be 
lost, though not necessarily. This same rule applies to division. Con- 
sider, for example, the product of 811.2 and 112.5, each rounded to three 
significant figures: 


















































The rounded numbers are both correct to five significant figures, and t 
result is correct to six, as is seen on comparison with the addition of t 
unrounded numbers. Here there was a gain of significance, becau 
the rounding errors canceled out, and the added significant figure resul 
from the carry. However, consider 12,761.5 and 11,435.5 rounded 
12,762 and 11,436 and added: 





Rounded Unrounded 
12,762 12,761.5 
+11,486 +11,485.5 
24,198 24,197 .0 


Here the result is correct only to four significant figures since, as can 
seen from the unrounded addition, the rightmost 8 is off by 1 unit, 


Rounded Unrounded 
811 811.2 
112 112.5 

1622 40560 
811 16224 
811 8112 
90832 8112 
91260 .00 


The third figure of the product is off by 4 units and therefore is not 
significant; the second figure from the left is, however, off by only 0.4 unit 
and therefore is significant. The product is thus correct to two sig- 
nee figures, while the multiplicand and multiplier were each correct 
to three, 
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As an example of the loss of a significant figure during division, con- 
sider 1,761.5 and 1,763.4, rounded to 1,762 and 1,763: 





Rounded Unrounded 
1.0005 1.001 
1,762|1, 763 .0000 17 615/17, 634.000 
1,762 17,615.000 
1.0000 19.000 


Here the rounded dividend and divisor are correct to four significant 
figures, but the quotient is correct only to three. 

One further rule might be helpful. In digital computers, where it is 
in general no more difficult to handle numbers with many significant 
figures, keep as many significant figures as possible during the computa- 
tions, and beware of loss of significance through subtraction. 

Pitfalls. A person using an electronic computer always worries about 
whether the results he gets are or are not accurate. For usually only the 
final results of a computation are read out from the computer’s memory, 
and intermediate results are neither retained nor observed. The results 
of computer programs are often checked by comparing a few values with 
some previously hand-computed values. However, the fact that these 
few isolated results check is no guarantee that all the results computed 
by the program will be equally correct. Rarely is every difficulty fully 
anticipated. Automatic programming routines aggravate this situation, 
increasing the possibility of overlooking pitfalls. Hence, when writing 
his program the programmer must plan against the loss of too many 
significant figures in the computation, as well as choose the computation 
method that can best result in the necessary accuracy. During th 
running of the program on the computer the programmer must be acutel 
aware of such pitfalls as overflows, loss of significance, difficulties arisin 
in intermediate forms, and other, unanticipated, singular situations. 

For example, consider a pitfall that occurs in evaluating ~/x. If 
x = 0.0000 3259, where x is good to seven decimal places and uncertain 
in the last (i.e., the 9), then to eight decimal places we get 


0.0000 3258 = 0.0057 0789 
+/0.0000 3259 = 0.0057 0877 
+/0.0000 3260 = 0.0057 0964 


Hence +/z is good only to five places, is uncertain in the sixth place, and 
is erroneous in the last two places. However, the computer will ¢ 
these last two places, and the error may be magnified in subseque 
calculations. 
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Another example occurs in computing, say, sin 314,159.3. For if we 
reduce the angle to one whose magnitude is less than 7/2, we find 


sin 314,159.3 = sin (314,159.3 — 314,159.27) = sin 0.03 = 0.03 


If the last figure of the original angle were 2 instead of 3, we would find 
sin 314,159.2 = —0.07. If 4, we would find sin 314,159.4 = +0.18. 
Hence, even though the original angle is known to within seven sig- 
nificant figures with an uncertainty of 1 in the last figure, the result is not 
good to within even one decimal place. 

The following quotationt summarizes the general attitude that should 
be maintained in performing numerical computations on a digital 
computer: 


Despite the availability of high speed computers, a successful computation 
depends on a judicious combination of mathematical analysis and numerical 
experimentation. In the actual programming of his problem, the numerical 
analyst should not use subroutines and automatic codes blindly, but should 
ascertain that the logical decisions made in them will be consistent with the 
required round-off error and tolerance limits. The possibility of pitfalls will 
always be present in spite of all efforts to avoid them. It is therefore important 
that the numerical analyst interpret his answers according to their true meaning. 
Although the tools of the numerical analysis are all based on rigorous mathemati- 
cal theorems the application of these theorems requires discretion. In carrying 
out a numerical computation program it is often necessary to use experimental 
techniques to establish the correct operational procedures. 


9-12. Exercises 


9-1. Solve 
3 
—18 


521 = 222 
42, — 522 


by following in detail the steps of the flow diagram, Fig. 9-1, keeping track of all the 
tallies. 


9-2. Solve 
O-a +422 + 23 — 34, = 1 
4a, + 0-2. — 223 + 2a, = 14 
82; — 7x2 + a3 — 32,4 = —12 
— 52, + 872 + 0-23 — x4 = -2 


Solution: 21 = 8, 22 = 7, x3; = 27, xy = 18. 

9-3. The flow chart of Fig. 9-1 does not account for zero coefficient in the leading 
term of Eq. (j)'. How can the flow chart be adjusted to account for this? 

9-4. Using the three-address instructions of Chap. 3, write a code that solves simul- 
taneous linear equations. 


{ Irene A, Stegun and Milton Abramowitz, Pitfalls in Computation, J, Soc, 7nd. 
Appl, Math,, vol, 4, no, 4, December, 1956, 
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Solve by successive approximations, to within 0.001 (decimal): 
9-5. 1.5 cos x + 22 — 2 = 0. 
9-6. sin z — tan? xz = 0. 
1 — 2? x : 
9-7. tag at = 0. (Hint: There are four solutions.) 
1 — 2? 


9-8. — tan? a = 0. 





Consider the equation 2 = 14(e** — e-%) = sinh az. Take xz) = 1, and show by 
direct computation: 

9-9. Ifa = —1.2, the ordinary iterative process will be of type 2 but the accelerat- 
ing process will converge. 

9-10. If « = 0.5, the ordinary iterative process will be of type 3 but the accelerating 
process will make the convergence more rapid. 

9-11. If « = 1.2, the ordinary iterative process will be of type 4 but the accelerating 
process will converge. 


9-12. In Fig. 9-6, on the right-hand sides of the equations for m and xe 41 We must 
use 2,41 since z*,, is not yet calculated; however, x* and 2*_, can be used. Why, 
then, does x, rather than x* appear in the numerator of the expression for m? 

9-13. In the second series expansion given in Sec. 9-3 for arctan 2, how many 
terms are needed to determine arctan 0.5 to within +0.001? 

9-14. Using the continued-fraction expansion of arctan x, compute the value of 7 to 
seven significant figures. (Hint: arctan 1 = a/4.) 

9-15. Draw a flow chart for evaluating the nth convergent of a continued fraction, 

9-16. Construct a difference table for the polynomial 


y = 2x4 + 3x? + 5a +2 
with h = 1. 
9-17. Introduce an error ¢ = 2 in one of the values y in the above polynomial 
recompute the difference table, and observe how the error propagates. 
9-18. Show that f0,1,2,3(2) actually does pass through all the points (r0,Yo), (t1,Y1 
(a2,y2), (@s,ys) and is of degree 3. ' 
9-19. Show that 
foalz) 20 — fo2(z) to — 2 
m fia(a) t2 — & fto(a) v1 — | 
fo,1,2(@) = = 
Le — Xo Y1 — ro 











9-20. Compute arctan 0.9, where arctan z is given by a table for values of x diff 
ing by 0.4, as shown: 





Xi arctan 2; Solution 
zy = —1.0 —0.7854 -1.9 
a = —0.6 —0.5404 —1.5 0.3783 
a. = —0.2 —0.1974 —1.1 0.6111 1.2515 


xz; = +0.2 +0.1974 —0.7 0.7707 1.1141 0.8736 
x, = +0.6 +0.5404 —0.3 0.7890 0.8917 0.7568 0.6691 
zs = +1.0 +0.7854 +0.1 0.7068 0.6863 0.7334 0.7509 0.7 








9-21. Compute arctan 0.75 from the table values given in the previous exercise, 
9-22, 9-28. Write a code to evaluate each of the best-fit approximations to 
x given in Sec. 9-5, 
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The following table gives the values of arctan x for equally spaced points: 


zi arctan 2: 
ay =0 0.00000 
a =0.2 0.19739 
rm =0.4 0.38051 
az; = 0.6 0.54042 
a, =0.8 0.67474 
a, = 1.0 0.78540 


0.8 
9-24. Evaluate ic arctan «dz by the rectangular rule, first for partition at 


Xo, 22, and a4, and then for partition at xo, 11, £2, Xs, and x4; compare the results. 
9-25. Evaluate 1 mo arctan x dz by the trapezoidal rule, first for partition at xo, 
xe, and a4, and then for partition at ro, 21, X2, £3, and 24; compare the results. 
9-26. Evaluate tee arctan «dz by Simpson’s rule, first for partition at xo, 22, 


and a4, and then for partition at ao, 21, 22, 23, and x4; compare the results. 


9-27. Integrate each of the three polynomial approximations for arctan x given 
in See. 9-5, evaluate each of these integrals from 0 to 0.8, and compare the results 
of each with the results of Exercises 9-24 to 9-26. 

9-28. Observing that (the indefinite integral) 


| arctan x dx = x arctan x — 14 In (1 + 2?) 


0.8 
find Is arctan x dz; compare this result with those of Exercises 9-24 to 9-27. 


9-29. Write a code for evaluating Simpson’s rule, assuming that f(x) is evaluated 
hy a subroutine. 

9-30. Determine the derivative of arctan z at « = 0.6, using first Ar between the 
two points at x = 0.4 and z = 0.6, and then between the points at z = 0.6 and 
c= 0.8. 

9-31. Determine the derivative of arctan x at z = 0.6 by using the three points 
first at zy) = 0.6, 21 = 0.8, and a2 = 1.0, then at a = 0.4, m1 = 0.6, and x2 = 0.8, 
and finally at ro = 0.2, 2; = 0.4, and a, = 0.6. 

9-82. Differentiate each of the three polynomial approximations for arctan x 
given in Sec. 9-5, and in each case evaluate the derivative at x = 0.6. 

9-33. Using the form 


d 1 
ae arctan x = T+2 
evaluate the derivative of arctan x at x = 0.6, and compare this with the approxima- 
tions of Exercises 9-30 to 9-32. 
9-34. Show that, if (xo,yo), (v1,y1), (%2,Y2), and (23,y3) are four points with equally 
spaced abscissas, then 


re h 
[ ydx = = (—yo + 138y1 + 13y2 — ys) 
2 24 


(Hint: Let ao = —3¢h, a. = —h/2, tg = h/2, and zs = gh; let the four functions 
by@ly=ay = ef, andy = «,) 
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9-35. If five points with equally spaced abscissas are given, (0,Yo), (21,y1), (@2,Y2), 


x3 
(v3,y3), and (a4,y,), then find ib y dz. 
9-36. Derive the formulas given in Sec. 9-7 by the method of undetermined 
coefficients. 
9-37. If four points with equally spaced abscissas, (20,Yo), (1,41); (ao,y2), and (23,Yys), 
are given, find formulas for 


#) ) w) av) 
dx] r=xo dx) eux, dx) x ax5 dx) cox; 


9-38. Find the formula for ‘ie y dx in terms of the two points (2o,¥yo) and (21,y1), 
0 


and of the derivatives at these points, 


ee | o— a) 
= (os ead aa dt) rox, 


(Hint: Let a = 0, 21 = h; let 


is y dx = Ayo + Ais + Boys + Bry 
where Ao, A1, Bo, and B, are undetermined coefficients. Now, when the functions 
y=ly=2,y =2, andy = are substituted successively into the equation, there 
result four simultaneous equations that determine Ao, A1, Bo, and B,.) 

9-39. Solve dy/dz = 1+ y? for h = 0.01 and h = 0.001 by the methods of Sec. 
9-9, and compare with a table of tan 2. 

9-40. Solve dy/dx = +/1 — y? by the methods of Sec. 9-9 with intervals h = 0.1 
and 0.01, where zo = 0, yo = 0. Compare the solutions with a table of sin x. 

9-41. Draw a flow chart for Runge’s method for solving differential equations. 

9-42. Write the difference equations corresponding to 

2 2, 2 
ae a = —Anp(z,y) and a = oon 

9-48. Using the boundary values given in Fig. 9-23, solve Laplace’s equation 
the relaxation method for 4 and then for 25 interior lattice points. 

9-44. Tell how many significant figures are in each of the following numbers, & 
round each to be correct to five significant figures (or bits): 


2.7182818 1.11001110 (binary) 
7.3890568 .00110010 X 2-3 (binary) 
0.3183099 101011.00 (binary) 
0.3678794 | 0011001100 Xx 23 (binary) 


1010101010 (binary) 
11.001100 (binary) 


484,813,681,110 X 107° 
154.1500 X 103 


9-45. Calculate the maximum relative error of each of the above numbers, rounded 
to three significant figures (or bits in the case of binary numbers). 

9-46. If all the numbers in a computation were kept to five significant figures, wha 
is a bound on the relative error of each number? If these numbers were binary nul 
bers, what is the bound on the relative error? 

9-47. If it is desired that all numbers involved in a computation have a reli 
error of less than }é90, how many significant figures should each number contain? 
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9-48. In a computation involving only binary numbers, if it is desired that all 
numbers should have a relative error of less than 1é099, how many significant bits 
should each number contain? 

9-49. General Formula for Measurement Errors. Suppose that N = f(r1,22, . . . ,2n) 
denotes any function of n independent variables x1, t2, . . . , tn that are subject to 


the experimental errors Ar:, Ave, .. . , At. Show that a formula giving the error 
in the function N is 
oN oN oN 
AN Saat + 55, O78 + sae + 3n, 478 


Suppose that the error of the independent variables is not actually known, but it is 
known that the probability distribution of the errors is normal (Gaussian). Then we 
can define the probable error of a single measurement as a value such that half the 
errors of a series of measurements will be greater, the other half less. If R, ri, re, 

- » fn are the probable errors, respectively, of N, 21, #2, . . . , tn, then show that 


oN\? aN\2 aN\2 
r= V(X n+ (5) ret. +(GE) ne 


For further discussion see Scarborough (see the References below), chaps. XIV—XV, 
and E. Frank, ‘Electrical Measurement Analysis,” chaps. 5-7, McGraw-Hill Book 
Company, Inc., New York, 1959. 

9-50. Error Formulas for Differentiation and Integration. Milne (‘Numerical 
Calculus,”’ pp. 108-114; see the References below) gives a method for deriving the 
error formula for any differentiation-integration method that may be developed by 
the method of undetermined coefficients. If Q(f) is the integration or differentiation 
to be performed on the function f and P(f) is the formula developed by the method of 
undetermined coefficients for its approximation, then the error R(f) = Q(f) — P(f). 
R(f) is said to be of degree n when R(x”) = 0 for m <n, but R(x*!) #0. Also 
let (c — s)" be defined by (2 — 8s)" = (x — s)"if x >s and (a — 8s)" =Oifa<s. 
In terms of this definition let 








G(s) = 5 RAG — 9" 


in which F,[(z — s)"] means R[(a — s)"] regarded as a function of the variable x. 
Then it can be shown that 


RD = [7 FEE) ds 


where f+! is the (nm + 1)st derivative of f (see Milne for an example). Using this 
technique, derive error formulas for the integration and differentiation formulas of 
Sees. 9-6 to 9-8. 
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CHAPTER 10 


BOOLEAN ALGEBRA 


10-1. Introduction 


There is perhaps too little appreciation of the great number and variety 
of problems that can be attacked and solved by methods of the propo- 
sitional calculus of symbolic logic and the related Boolean algebra, which 
is the calculus of sets and classes. Direct application of logic can always 
be an aid to deductive reasoning—such as determining consequences 
of given premises, rules, or axioms and making hypotheses or theorems 
from which the given premises or factual relationships can be deduced. 
Besides the use of logical propositional methods in problems concerned 
with sentences, such as analysis of military information reports and legal 
and insurance documents, there appear to be even more important appli- 
cations to fields of operations research, biology, medicine, design of experi- 
ments, etc., where the utility of symbolic logic per se is not so immediately 
evident. In this chapter we present digital methods which can aid in 
making many types of logical problems computationally feasible. It is 
hoped that these methods can be directed toward the successful solution 
of many such problems—two illustrative examples being given at the end 
of this chapter. The digitalized methods which use the binary number 
system provide easy mechanization on existing high-speed electronic 
computers. 

Historical Background. It is of historical interest to note that this 
algebra was first investigated by George Boole (1815-1864) and the theory 
essentially completed by Ernst Schroeder in his monumental work on the 
algebra of logic before the turn of this century. Boole, Schroeder, and 
other logicians of that time had in mind the practical uses of the algebra 
of logic in the solution of everyday problems. However, they found that 
dealing with just four to five variables made practical computation 
unfeasible, requiring the employment of extremely ingenious methods for 
solution. Hence interest in solving large-scale logical problems of this 
kind was lost. 

It was at that time, too, that Whitehead and Russell, Hilbert, and 
others realized that logie could be developed to encompass the foundations 
of mathematics, and after the turn of the century the attention of 
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logicians became focused on the so-called functional calculus of symbolic 
logic. Problems of extreme metamathematical and philosophical impor- 
tance were considered. Such problems are still today of primary interest 
to professional logicians. Their methods have necessarily been esoteric 
and have as yet found little everyday application. 

On the other hand, the advent of digital-computer circuitry brought a 
widespread reawakening of interest in the algebra discovered by Boole. 
Computational difficulties again limited the direct, practical application 
of Boolean algebra. To help offset some of these difficulties, digital com- 
putational methods have recently been developed, and some of these are 
presented in this chapter. 


10-2. Definition of Logic and Propositions 


Symbolic Logic. Logic is the science that teaches how thinking or 
reasoning should proceed; it produces the results of thinking; and it 
encompasses laws by means of which a judgment can be passed as to 
the correctness of the products of thought or reasoning. As H. Reichen- 
bach points out, however, “When we call logic analysis of thought, the 
expression should be interpreted so as to leave no doubt that it is not 
actual thought which we pretend to analyze, it is rather a substitute for 
thinking processes, their rational reconstruction, which constitutes the 
basis of logical analysis.” In other words, once a result of thinking or 
reasoning is obtained, a chain of thought can be constructed from the 
initial premises to the final conclusions and an analysis of this rational 
reconstruction of thinking reveals those rules called laws of logic. Logie 
presents a means by which thinking or reasoning may be simulated and 
the results of thinking obtained or checked. 

Only when the results of reasoning have been stated in terms of sen- 
tences can they be tested or evaluated; logic deals with sentences (e.g, 
statements) and the rules correlating premise and conclusion sentences, 
The best method of handling these sentences in logic is a symbolic method, 
for while simple logical operations can be performed without the help of & 
symbolic notation, the structure of complicated relations cannot be seen 
without the use of symbols. 

The Proposition. The methods of symbolic logic that are concerned 
with deduction must lead from true sentences to true sentences. Hence 
the whole sentence becomes a fundamental unit for logic, since only & 
whole sentence can be true or false. In fact, attention in this book will 
be limited only to whole sentences with this property, and such sentences 
will be called propositions. A proposition is a whole sentence about whieh 
at zs meaningful to say that its content is true or false. (Some authors 
this somewhat differently, limiting their consideration only to meaning 
sentences. In their system a sentence is meaningful only if it is verifab 
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as true or false.) The part of symbolic logic that deals with propositions 
is called the propositional calculus. 


Examples of propositions and their truth values are as follows: 


Proposition Truth value 
a, 3 is an even number............000 0000 c cece ccc cece cece ee. False 
b. The grass is green... 0.2... ccc cece cence nee neeneee True 
c. Logie is the science that teaches how thinking or reasoning should 
PEOCCED fet it $5.0 418 G Be eee hig wha fos vee ees Wee ae MT een ae True 
d. The proposition on the line below is true........................ False 
e. January is a summer month..............000.0 000000 cece cece cece False 


In textbooks dedicated solely to symbolic logic a distinction in sym- 
bolism is made between propositions and propositions about propositions. 
However, since the purpose of this chapter is only to introduce computa- 
tional methods in the propositional calculus of symbolic logic and not to 
expound the subject itself, no consistent attempt will be made to separate 
propositions of these two types. The distinction between propositions 
of these types is to be made by the reader from the context. 

The Propositional Operations. It is to be reemphasized that this 
chapter deals with a logic of two truth values, viz., true and false, and 
that the primary interest is in determining the truth value of propositions. 
In considering combinations of propositions, the truth value of the com- 
bination is again the primary object. 

Propositions can be combined or manipulated by the words and, or, 
and not (negation), these words being called the propositional operations. 
The operations and and or combine two propositions, whereas the opera- 
tion not (or negation) is associated with only one proposition. Given 
the truth value of propositions, the truth values of their combinations 
with the propositional operations are defined as follows: 

1. The and combination of two propositions is true if and only if 
both propositions are true. If either one or both of them are false, the 
combination is false. For example, ‘The grass is green, and logic is the 
science that teaches how thinking or reasoning should proceed” is true, 
but “The grass is green, and 3 is an even number” is false. The and 
combination of two propositions is often called their conjunction, or 
logical product. 

2. The or combination of two propositions is true if and only if either 
one or both of the propositions are true. The or combination of two 
propositions is false only if both propositions are false. For example, 
“The grass is green, or 3 is an even number” is true; “3 is an even num- 
ber, or January is & summer month” is false, » The or combination of two 
propositions is often called their diajunetion, or logical sum, 
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3. The negation of a proposition is true if and only if the proposition 
is false. For example, ‘‘3 is not an even number” (which is the negation 
of “3 is an even number’’) is true. 

Propositions may be symbolized by letters: A, B, C, . weg jos 
Y,.... In later sections we shall symbolize propositions by letters 
with subscripts: Ai, Ax, ..., Bi, By... - We use the symbol : for 
the operation and, + for or, and ~ (bar) over the letter to indicate nega- 
tion. This symbolism is summarized in the following table: 


Sm 

















Operational symbol | Interpretation | Terminology 
Not A Negation 
AorB Disjunction, logical sum 


+B ; 
-B A and B Conjunction, logical product 


The use of + for the or symbol arises from the parallelism between this 
logical operation and arithmetic addition. For example, 


2 bananas + 3 apples = 5 pieces of fruit 


where pieces of fruit = bananas or apples. The parallelism between the 
logical product and the arithmetical product also holds; for example, 
2 ft -3 lb = 6 ft-lb since the unit of work, namely, the foot-pound, is 
a logical and combination. _ 

Boolean Functions. Combinations of propositions joined by or, not, 
and and can be determined true or false and hence are propositions them 
selves. The uncombined propositions are called elementary elements and 
are usually denoted by letters at the beginning of the alphabet : A, B, 

Propositions combined by the propositional operations are called 
combined elements, or Boolean functions, and are usually denoted by letters 
at the end of the alphabet: X, Y, . . . . Combined elements are somes 
times written in functional form, that is, f(A,B, . ._.). 

For example, consider the combined element A+B-C. Suppose 
that A is false, B is true, and C is false (that is, A might represent propos 
tion a of the above examples, B might represent proposition 6, and C 
might represent proposition ¢). In such a case B-C is false (singe 
C is false), and A is true (since A is false); hence A + B+ C is trug 
(since A is true). Note that A - B is distinct from AB; the former 
read (not A) and (not B), while the latter is read [not (A and B)) (t 
parentheses are for clarity). If A is true and B is false, then A 
false and A - B is true. 
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10-3. Boolean Expressions 


There are two particularly important combined elements, namely, 
A+B and A-B+A-B. Their significance and meaning are dis- 
cussed here. 

Implication. First consider A + B. This is usually abbreviated 
A— B, called ‘‘A implies B,” or “If A, then B,” by logicians. In a 
sense this corresponds to the everyday linguistic use of the word implies, 
for if A is true and A — B is true, then B must be true; also, if A — B is 
true and B is true, A may or may not be true. For example, consider 
the proposition “if an airplane is a fighter plane, then it has light armor.” 
Certainly, if the airplane is a fighter, it will have light armor, according 
to the proposition; but if the airplane has light armor, the proposition 
does not reveal whether it is a fighter or not, for other types of airplanes 
than fighters have light armor. 

Equivalence. Consider now the important function A4-B+ A-B, 
called A is equivalent to B” and abbreviated A = B. If A-B+A-B 
is true, then A and B have the same truth values, for then either A and B 
are both true or they are both false. Also, the meaning of the equals (= ) 
sign in logic parallels its meaning in arithmetic, in which any item can 
be substituted for an equivalent one. For it can be shown that, if 
A = B is true, then some combination of the propositions U, V, and A, 
say f(U,V,A), is equivalent to that combination with B replacing A, 
that is, f(U,V,A) = f(U,V,B). In other words, it is always true that 
(A = B)—>[f(U,V,A) = f(U,V,B)] in the propositional calculus. 

Tautology. This naturally brings up the topic of tautology. A 
tautology is a combined element that is always true, independent of the 
truth or falseness of its component propositions. For example, A + A, 
(A-B)+ A, (A-B)+ (A+ 8)-(A + B), as well as 


(A = B) > [f(U,V,A) = f(U,V,B)] 


are all tautologies, for they are true no matter what the truth values of 
A, B, U, and V are (or what the truth value of the combined element 
fis). Therefore there is no logical reason to distinguish between these 
combined elements, and any one of them is called the universally true 
clement, or tautology, and denoted by I. “The negation of J, namely, J, 
is the universally false element, denoted by 0. A consequence of these 
concepts is that 0 > X and X — TI are always true; i.e., the universally 
false element implies everything, while everything implies the tautology. 

To review, then, there are two meanings to truth. One is “tauto- 
logically true,”’ and the other is “true because it is a physical fact or 
supposition,”’” “The grass is green” is true in the latter sense; “Today 
is ‘Tuesday, or today is not Tuesday’ is tautologically true, Logicians 
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often use a special symbol to distinguish these meanings, but for our 
limited purposes this would be an unnecessary complication. 

Equations of Boolean Algebra. With the foregoing interpretation the 
algebraic properties of the logical operations can be more clearly under- 
stood. The following equations characterize a Boolean algebra. It is 
to be noted that these equations are quite different from the familiar 
rules of ordinary algebra. The subject that comprises algebraic manipu- 
lations according to the equations given below is therefore called Boolean 
algebra to distinguish it from ordinary algebra. Some of the equations of 
Boolean algebra are particularly important, and mathematicians have 
given them special names, as shown: 


Name of equation 





1.A+tA=A Absorption rule for + 
2.A-A= Absorption rule for - 

3. A+B=B+A Commutative rule for + 
4,.A4-B=B-A Commutative rule for - 
5 (A+B)+C=A+(B+C)=AtBH+C Associative rule for + 
6 (A-B)-C =A-(B-C) =A-B-C Associative rule for - 
7.A-(B+C) =A-BH+A:C Distributive rule of - over + 
8 A+B-C=(A+B)-(A+C) Distributive rule of + over: 
9 A-B=A+8B De Morgan’s rule for - 
10. A4+B=A-B De Morgan’s rule for + 
11.A+7=1 

12. A-I=A 

13.0+A=A 

14.0-A =0 

15.4 +A =I 

16. A-A =0 

17.A+A-B=A 

18. A+A-B=A+B 

1. A-B+B-C+C-A=(A+B)-(B+C)-C +A) 

20. A-B+A-C+B-C=A-B+B-C 

21.4 =A 


These equations are to be interpreted as tautologies. The equals sign 
means that, for any given combination of truth values for A, B, and C, 
the combined element on the right-hand side of the equation has the same 
truth value as the combined element on the left-hand side. Hence these 
equations can be checked by writing out truth values for the right-hand 
and left-hand combined elements. 

Algebraic Proofs. Some of these equations can be “proved” alge= 
braically by using others. For example, let us prove (20) by using 
(12), (15), (7), (6), (4), (8), and (17).f 

+ Note how difficult and rather “tricky’’ it is to prove (20). With experience and 
practice some facility can be gained in performing such manipulations, However, the 
student need not dwell on gaining such facility, for in Sec, 10-5 a systematic and 
straightforward method will be given for doing such problems, 
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A-B+A:C+B-C 

=A-*B+A-C-I+B:C by (12) 
=A-+B+A-C-(B+B)+B-C by (15) 

=A~ Ba AsO) iA Os Bot Be by (7) 
=A-B4+(A-B):-C+B-C4+(B-C):A by (6), (4), and (8) 


=A:B+B-C by (17) applied twice 


New equations may also be proved by using those listed above. To 
show that J- A = A, we have 7: A = A-I = A by (4) and (12). 

There is another way of proving equations, namely, by making use of 
(15) and (11), for we want to show that an equation is a tautology (i.e. 
the meaning of ‘proving equations’). If we can show that the Coun 
tion itself is equivalent to J (in truth value), then we have shown that 
it is a tautology, since J is the universally true element (i.e., always true 
for all truth-value combinations of the elementary elements). 

For example, consider the proof of (17): A+ A-B = A isan abbre- 
viation for 


(A+ A-B)-A+(AFA-B):A 


but, on using (2), (3), (4), (6), (7), (9), (10), (11), and (15), we find that 
this expands to 


A+A:-B+A4-:(A4+B)-A=A+A-B+A4+A'B 
=A+A+A-B+A-B 
=7+A:B+A-B by (15) 
=f by (11), to complete the proof 


As another example, we can show that A+: B— A is a tautology, for 
A B= A is an abbreviation for A- B+ A, which becomes, by (9) 
A+B+A. But A+ 8+ A=A+A+4B [by (3) and ()] = 
I + B [by (15)] = J [by (11)], to complete the proof. 

There is still another way for proving equations, i.e., to make both 
sides of the equals sign look identical. For example, let us prove that 
(A— B) = (B— A). From the definition of > we have 4 + B= 
B+A or A+B=A+4+B, by (21) and (38). As another example, 
let us show that 


A> (B>-C)=A:B-C 


By the definition of — the left-hand side becomes A + B + C. Simi- 
larly the right-hand side becomes A-+-B+C; by using (9) this becomes 
A + B+.C, as desired. 

Other Logical Operations. It should be observed that Boolean algebra 
can be accomplished with just two operations, + and ~, since the - oper- 
ation can be given in terms of + and ~ as a+b = & +5. There are in 
fact. single operations in terms of which Boolean algebra can be accom- 
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plished. Consider the operation p,(a,b) defined in terms of + and ~ as 
follows: pi(a,b) = a + b. To show that Boolean algebra can be accom- 
plished in terms of this single operation, all we need do is define +, -, and 
—in terms of pi(a,b). Thus, as can easily be checked from our definition, 
a = pi(a,a); a + b = pilpi(a,b),p.(a,b)]; and a+b = pilpi(a,a),pi(b,b)). 
Another such single operation is po(a,b) =a-b. Here we have 
a= p2(a,a), a + b= plp2(a,a),po(b,b)], and a:b = p2|p2(a,b),p2(a,b)]. 
Note that J and 0 can be defined in terms of p; or po. For p: we have 
I = pi{pila,p.(a,a)],pi[a,pi(a,a)]}, and then 0 = p,(J,J). Can you define 
7 and 0 in terms of p2? If we admit J and 0 as primitive concepts, then 
there are four further single operations that can be defined, namely, 
p3(a,b) =4 tr b, ps(a,b) =at+ b, ps(a,b) = a° b, pe(a,b) =a°* b. 
For example, for ps; we have @ = p3(a,0), a+ b = pslps(a,0),b], and 
a+b = ps{psla,p3(b,0)],0}. Can you show that ps, ps, and ps are each 
also valid single operations? What algebraic properties do pi, . . . , De 
have? The single operation p; was first introduced by H. M. Sheffer. 


10-4. Representations of Boolean Expressions 


Sets. We have introduced Boolean algebra as the algebra of propo- 
sitional logic. However, Boolean algebra can also be interpreted as the 
algebra of sets, or classes. A class is a collection of objects all of which 
have a certain property; such a well-defined collection of objects is often 
called a set. For example, the collection of all automobiles manufactured 
in the year 1960 and still in operating 
condition is a set. Or the collection 
of people who registered at the 
national meeting of the Institute o 
Radio Engineers in 1960 forms a set. 
However, most often for the pur- 
poses of mathematics a set is visua 
lized as the area inside a closed 
curve drawn on a piece of paper) 
the set is roughly interpreted as the 
collection of points that lie inside the closed curve (see Fig. 10-1). The 
objects that belong to the set are called elements of the set. For example, 
the point p is an element of the set (the interior of the circle) of Fig. 
10-1. Sets are often denoted by letters; for example, we call the set in 
Fig. 10-1 the set A. 

Just as the prime purpose of propositional logic was to determine the 
truth value of a proposition, so the prime purpose of the algebra of sets is 





Fig. 10-1. Sets A and A. 


+ A Set of Five Independent Postulates for Boolean Algebras, Trans. Am, Math, 
Soc., vol. 14, pp. 481-488, 1913, where pi(a,b) was denoted by a|b, commonly known 
Sheffer’s stroke. 
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to determine whether or not a point is in a set. In fact the only difference 
between the propositional calculus of logic and the algebra of sets is that 
the algebra of sets deals with a very restricted kind of propositions. 
These are propositions of the form “The point p is in the set A” and 
combinations of such propositions with and, or, and not. Hence the 
hatched area shown in Fig. 10-1 represents all those points for which the 
proposition ‘““The point p is in the set A” is true. Let us symbolize this 


proposition by A. Then A is true for all points p not lying in the 
hatched area. 

Intersection and Union. All the points for which A - B is true (i.e., 
for which “‘the point p is in the set A, and the point p is in the set B” 
is true) are represented by the hatched area shown in Fig. 10-2. All the 
points for which A + B is true (i.e., for which “the point p is in the 


9 





Fig. 10-2. Set A- B. 


Fig. 10-3. Set A + B. 


set A, or the point p is in the set B’’ is true) are represented by the hatched 
area shown in Fig. 10-3. The hatched area shown in Fig. 10-2 is called 
the intersection of the set A with the set B, that is, the set of points that 
lic in both A and B. The hatched area shown in Fig. 10-3 is called the 
union of the set A with the set B, that is, the set of points that lie in 
cither A or B or both. 

Now, as we have seen, Boolean algebra holds for propositions, and 
hence it must also hold for our set interpretation of propositions. Hence 
we can visualize some of the relations of Boolean algebra by means of 
the set diagrams. For example, consider the relation A-B = A + B. 
We shall show that A- B and A + B represent the same area. Figure 
10-4 gives the area for A, Fig. 10-5 for B, and Fig. 10-6 for A + B. 
But, observing Fig. 10-2 for A - B, we see that the area for A + B is the 
same as that for A+ B. 

Just as in the propositional logic, given the truth value of the ele- 
mentary elements, the truth value of a combined element can be deter- 
mined, so, given the location of a point p with respect to the elementary 
sels A, B, .. . , we can determine if it is in the set described by a com- 
bined element, For example, if a point pis in set A, not in set B, not in 
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set C, then is it in the set for which (A + B) - C is true? The solution ’ 
een in Fig. 10-7. The hatched areas are all those points for whic 
(A + B) - C is true, and the point p is in set A, but not in set B or set C. 
Hence p is in the set (A + B) °C. 

These diagrams representing sets by intersecting areas are called ee 
diagrams (attributed to Venn by Schroeder). Figure 10-8 is such a dia- 
ram for four variables. oe 
i Logical Translation of English. The limitation on the practical appli- 
cation of the propositional calculus arises from the difficulty of trans- 
lation of the practical worldly situations into these precise statements. 


yy 
Qa 






















cS \ | 
Sf 


Fig. 10-7. Set (A + B)- C. 


Fia. 10-6. Set A + B. 


This translation, the first step in formulating the problem for o- 
is by no means obvious and requires the work of a trained logical analy 
Part of this difficulty is the prevalent loose and often Oo ee 
language, and in particular of those very conjunctions which form 
i erations. fl 
ge ener the word unless as used in the combined propositiom 
unless B” is defined by logicians as A + B, since A unless a 7 00) 
sidered equivalent to ‘‘A if not B,” that is, “af not B, then ne f q 
written symbolically as B— A. However, since not all users of the 
unless are familiar with the logicians’ delineation of the term, one 0 
finds it employed with an entirely different connotation, lor instar 
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“A unless B” is often used to denote “If B, then not A,” symbolically 
B— A, or B+ A. A third possibility for “A unless B” might be “A, 
but if B then not A,” which becomes A - (B+ A), or A-B. It is thus 
evident that in the formulation of the propositional elements from the 





Fig. 10-8. Four-variable Venn diagram. 


original raw sentences one must have enough familiarity with the subject 
matter to be able to discern the intent of a sentence in its given context. 


In Table 10-1 we have suggested some translations for various English 


connectives. Caution must be used for the translation of or to distinguish 


TaBLe 10-1. TRANSLATIONS OF ENGLISH CONNECTIVES 


English connective Logical translation 


NOt Ais Mocs ticansohin ouvienvbaieléd beciee. oe A 

A and Bis ccc sciced vas oecle da sites Seca unr A-B 

A or (inclusive) B; A or B or both............ A +B 

A or (exclusive) B; either A or B............. A#B,A-B+4A-B 
A DUG Be: csgeailelcinng coho stees oer i s vs, ae A-B 

A although B..... 0... eee eee A-B 

A unless B..... eee ec ee cece ce. BoA,B+A 
A on condition that B...................... BoA,B+A 
OMB ies sresecttebn WS cathee en as bat cade BoA, B+A 
If A, then B; A implies B................... A>B,A+B 
A OWN Bihisccesoceg ie 0ae fe te bahadaee ecee: A+B,A+B 
Not unless A then B........................ A> B,A+B 


A provided that B 
A as wellas B.....................0.0..... A-B 


tra apa hetsa banda Ghana A=B,A-B+A-B 
Not both A and B...................... 


_A-B,A+B 
Neither A nor B.............0.. 0.000 0000.. A-B,A +B 
When A, then B..............0.0.000000... A>B,A+B 
A\DOCMUBO BD toh PTS cates flask ons cgicteiswcua BoA,B+A 


} Often imprecisely synonymous with “A if and only if B.”’ 


between the inctusive or, that is, A or B or both, and the exclusive or, that is, 
vither A or B but not both, Also care must be observed in using “If A 
then B,” or “A implies B.” Strict logical interpretation is A — B, or 
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A+B. Consider, for example, A— B- C. This is equivalent to 
(A— B):-(A— 0), which is reasonable in terms of the English trans- 
lation. Similarly A — (B + C) is equivalent to (A > B) + (A C), 
which again is reasonable. However, A:-B—C is equivalent to 
(A—>C)+(B—C), which upon English translation is not obvious 
without considerable reflection; i.e., “Both A and B imply C”’ is the 
same as “A implies C, or B implies C, or both A implies C and B implies 
(Similarly (A + B)— C is equivalent to (A> C)- (BC), which 
again is not particularly obvious upon English translation. 

Other Representations. A geometric representation of a Boolean alge- 
bra may be obtained by considering the vertices, edges, faces, etc., of a 


A:B:C 
111) 





Fg. 10-9. Two- and three-dimensional hypercubes. The numbers represent b 
columns (see Sec. 10-5). 


hypercube, as in Fig. 10-9. For two variables we obtain a square ( 
Fig. 10-9a), where the vertices are the elementary products—that 
products of all the variables, barred or not (see below)—and the ed 
are the functions formed by the sum of the adjacent vertices. For th 
variables we obtain a cube, where now the faces are the sum of t 
adjacent vertices (or edges), and so forth (see Fig. 10-9b). In Fig. 1 
we have labeled the edges; can you label the faces? 

Another representation is a Boolean lattice diagram, illustrated in Vr 
10-10, for a two-elementary-element system. A point Y on the lattice 
said to include another point X on the lattice if a continuous upward pil 
can be found from X to Y along the lines drawn on the lattice diagr 
For example, on our lattice, A + B includes A - B; A + B includes 
and so forth. One point of the lattice is said to be below (above) an 
point if it is on a row below (above) the row of the other point, 
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ee on our lattice, A - B is below B, A + Bis above A-B+A-B, 

The lattice is so constructed that the logical sum of two elements is 
the lowest point which includes both elements. For example, A + B is 
the logical sum of A and B, A + B is the logical sum of A Band B 
ete. The logical product of two elements is the highest point which de 
included by both elements. For example, A - B is the logical product of 


A 
0101 





Fig. 10-10. Lattice diagram for two variables. 


A and B; A - B is the logical product of A + B and B. The numbers 
represent designation numbers (see below). 


10-5. The Designation Numbers 


. The truth values of a combined element depend on the truth values of 
its component elementary elements, as dictated by the definitions of the 
propositional operations. A table can be made presenting all possible 
combinations of truth values; we call this a truth table. Consider, for 
example, A + B. The rows of A and B represent all possible truth-value 
combinations of A and B, and the rows of A + B represent the corre- 
sponding truth values of A + B, where 7’ stands for true and F for false: 


A | fT FT 
B ee 
A+B) F TT TT 
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Similarly the truth tables for A and A - B are 








“A |F TT A FTF T 
7a B| ke TT 


A-B | F F F T 





iti he propositional opera- 
These truth tables follow from the definitions of t iti 
tions and, conversely, could have been taken to be the definitions of these 
operations. With these basic truth tables in mind the truth table for any 
combined element can be constructed. 7 7 
For example, recalling that A — B stands for A + B and that A = B 
stands for A: B + A - B, the following table can be formed: 


A F T F 
B F F T 


A—>B T F T 
A=B | T F F 











ie) ee | 





Note that A = B is true only when A and B have the same truth values. 
Similarly truth tables can be made for combinations of any number a 
elementary elements. For example, for three elementary elements 4, 
B, and C, the truth table for the combined element A + B-C becomes 








Fi FF TF TFT 
B FR TT FFT 
C FR PPTTTT 
AGE Ol TF OF fF TT 





Using the truth-table idea, but replacing 7 by 1 and F by 0, we ca 
form a binary number for each Boolean function. The computation 
methods to be described involve such binary numbers, called designalt 
numbers. To every element, elementary or combined, we associate 
designation number. In doing a problem, the designation-number rep 
sentation for each proposition is obtained; the computations are pe 
formed in terms of these numbers; and finally the answer is reinterpret 
in terms of propositions or sentences. The procedure for going y 
propositions to designation numbers will be considered first, and that 
returning from designation numbers to propositions will follow. 

The Basis. The designation numbers for the elementary elements} 
assigned first. Such an assignment is called a basis. One such basis f 
a system of three elementary elements ist 

0123 4567 
#A =0101 O101 


#B = 0011 0011 
#C = 0000 1111 


t Also called basic symbols or variables. 
{The # symbol means “The designation number of . . . « 
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where the upper small numbers merely number the positions of the 
columns of the basis. For a system of n elementary elements, there are 
2” digits, or bits, in each designation number and hence 2” columns in a 
basis. Any permutation of the columns of the basis shown will result in 
a different but nonetheless valid basis. The columns of a valid basis for 
n elementary elements need only represent all the 2” possible combina- 
tions of 0 and 1 taken at atime. Hence there can be 2”! different bases. 

One advantage of the type of basis illustrated above is that it may be 
written out directly for any number of elementary elements as follows: 
the number for the first elementary element alternates zeros and units; 
for the second alternates pairs of zeros and units; for the third alternates 
4 zeros and 4 units; for the fourth would alternate 8 zeros and 8 units, 
and so forth, each such designation number being extended to comprise 
the full complement of 2 bits. The column positions of the basis are 
numbered from the left, from 0 through 2” — 1. Such a basis is called a 
standard basis, and unless otherwise stated it is always assumed that all 
designation numbers refer to this type of basis. It is often convenient to 
denote this basis by b[ A,B,C], where the order of the elements in the square 
brackets is to be the same as the rows in the basis. Following this con- 
vention we have, for example, in the case of four elementary elements, 
the standard basis b[A,B,C,D]: 


of 2.3 4 5 6 7 8 9 10 11 12 13 14 15 
#A=0 10 1 0 10 1 010 1 0101 
#B=0 011 0011 0 0 1 1 001 1 
#C=0 0 0 0 1111 0 0 0 0 114141 
#D=0 00 0 0 0 0 0 I o1o1 1 1111 


Observe that each column of a standard basis, read from the bottom to 
the top, forms a binary number which is the same as the number of the 
1 


column position. For example, column 3 is 4 corresponding to 0011 = 3 
0 


0 
(decimal), and column 14 is } corresponding to 1110 = 14 (decimal). 
1 


Determining the Designation Number of a Boolean Function. In order 
to find #(A + B), we define an operation of “logical addition,” to be per- 
formed on #A and #B, wherein 0+0=0,0+1=1+0= 1, and 
1+ 1= 1 without carry. For example, with respect to the standard 
basis for two elementary elements, 


#A = 0101 

#B = 0011 

#(A +B) = O111 
To find #(A + B), we similarly define the position-by-position operation 
of “logical multiplication,” wherein 00 © 0, O+ be 1:0 = 0, and 
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1:1 =1. For example, 
#A = 0101 
#B = 0011 
#(A-B) = 0001 


To find # A, we invert each digit of # A, that is, change 0 to 1 and 1 to 0. 
4 le, 7 
For examp aaa 

With these operations on designation numbers in mind, we determine 
the designation number of a combined element merely by performing the 
indicated operations with respect to the chosen basis. For example, to 
find #(A + B-C), we have, with respect to a basis for three elementary 


ee #B = 0011 0011 


#C =0000 1111 

#(B-C) = 0000 0011 

#A = 1010 1010 

#(A +B-C) = 1010 1011 





There should be no confusion as to the dual role of the symbols +, 3a 
When applied to propositions A,B,C,... , they are the logical operas 
tions or, and, and not, respectively; when applied to designation numbe 
#A, #B, #C,..., they are the numerical operations logical add, 

J } and invert, respectively. 
a lennon g ae Sonis cinta results of this digitalization , 
propositions are: (1) #7 =1111-:: ; that is, # I has all units. ( 
#X = #Y if, and only if, X = Y. (3) X— Y if, and only if, #Y 
units in at least those positions which correspond to the units of #X. 

For example, to demonstrate equivalences, we simply cole ia ‘ 
designation number of each side; if they are identical, the equation ho 
To show that A°-B+ A:C+B-C= A-B+B-C, we find 


#(A-B) = 0001 0001 #(A-C) = 0000 0101 
and #(B-C) = 0000 1100 
whence ° #(A-B+A-C+8-C) = 0001 110! 


but #(A- B+ B-C) = 0001 1101 also, whence the equivalence mu 
hold. To show that an implication is true, simply show that the impli 
proposition has units in at least the same position as the implier. 7 
example, to demonstrate that A-B+B-C>A+4+C, we note t 
: 3.45 a ; 
#(A-B+B-C) = 0001 1101, which has units in positions 8, 4, 5, 
3.457 


7: also #(A + C) = 0101 1111, which obviously, has units in at | 
these positions. 
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10-6. Forms of Boolean Functions 


From Designation Number to Boolean Function. Given a designation 
number, there are systematic methods for finding its symbolic Boolean 
algebraic representation in several different forms. For an example of 
five different equivalent representations of the same designation number, 
consider the number 0111 0100. (1) The first canonical form of this 
isA-B-C+A-B-C+A-B-C+A-B-C. (2) Thesecond canon- 
icalformis (A + B+C)-(A+B4+6):(A+8B+4+6):(44+ B40. 
(3) The simplest sum-of-products form is A:B+B-C. (4) The 
simplest product-of-sums form is (A + B): (B+). (5) A mongrel 
formis (A + B):C+ A-B-C. Allthese combined elements are equiv- 
alent. In this section systematic methods for obtaining the first two 
and last forms are given; methods for finding the third form are given in 
the following section; the simplest product-of-sums form is left as an 
exercise. 

The First Canonical Form. The so-called first canonical form, also 
called the disjunctive normal form, is a sum of elementary products. An 
elementary product is a product that contains all the elementary ele- 
ments or their negations. For example, all possible elementary products 
for three elementary elements are A: B-C,A-B-C,A-B-C,A-B-C, 
A-B-C,A-B-C,A-B-C,and A-B-C. Observe that the designa- 
tion number of an elementary product always has a single unit: e.g., 


#(A-B-C) = 1000 0000 
#(A-B-C) = 0100 0000 
#(A-B-C) = 0010 0000 
#(A-B-€) = 0001 0000 
#(4-B-C) = 0000 1000 
#(A-B-C) = 0000 0100 
#(4-B-C) = 0000 0010 


#(A+B-C) = 0000 0001 


lor n elementary elements there are 2” such products. The first canoni- 
cal form of a designation number is the sum of only those elementary 
products each of which contributes a unit to the number. For example, 
0100 0011 has units in positions 1, 6, and 7 and can therefore be con- 
structed by summing the elementary products whose designation num- 
bers have units in positions 1, 6, and 7. Hence, we have 
#(A-B-C) = 0100 0000 
#(A-B-C) = 0000 0010 
#(A-B-C) = 0000 0001 
#(A:- B.C +A-B-C + A-B-C) =0100 0011 
i.e., the first-canonical-form representation of 0100 0011 is 


A‘B:OC+A‘B'C+A'B'C 
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The Second Canonical Form. To find the second canonical form, also 
called the conjunctive normal form, note that the designation number of 
the sum of all the elementary elements or their negations has exactly one 


zero. 


#(44+B4C) =1111 1110 
#(A4+B4+C) =1111 1101 
#(A4+B4+C) =111! 1011 
#(A4+B+O) =1111 O111 
#(4+B4C) =1110 1111 
#(A +B+4C) =1101 1111 
#(A+B4C) =1011 1111 
#(A44+B4C) =0111 1111 


These sums are called elementary sums; for n elements there are 2" 
elementary sums. The conjunctive normal form corresponding be. 7 
given designation number is the product of those elementary sums whic 


correspond to the zeros in the designation number. For example, for 
45 a 
0100 0011 we have zeros in positions 0, 2, 3, 4, and 5, and hence 


B (A+B (A+B+C)-(A+B+0O) 
#U(A+BHC)-(A4+B4+C)-(A+B+0) (At ne ito ae 
This follows because each elementary sum in the product contributes one 
of the zeros of the designation number. 


#(A +B4+C) =0111 1111 
#(A+B+4C) =1101 1111 


#(4+B4+0C) =1110 1111 
#(A +B+6) =1111 O111 
#(A +B+4+C) =1111 1011 





# (product) = 0100 0011 
“ i aie Forms. The mongrel form is not a unique form ; i.e., two peop 
determining a mongrel form for the same designation number na n 
arrive at the same Boolean function. In general, in the mee or 
anything goes, just as long as the final Boolean function actua s rep 
sents the designation number under consideration. For example, ¢o 
sider the designation number 0001 0111. We note that 
#A =0101 0101 


#B = 0011 0011 
#C = 0000 1111 


#(A-B) = 0001 0001 
#(A +B) = 0111 0111 


But since #G = 1111 0000, then #(A - B- C) = 0001 0000; also, sin 
#C = 0000 1111, then #[(A + B) - C] = 0000 0111, whence 


0001 0111 = #[A°B-C+(A+B):C| 
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or A-B-C+(A4+B)-Cisa mongrel form. The mongrel form is 
particularly useful in more complicated cases. For example, consider 
0001 0111 0111 0001 1110 1110 1000 1000. Here we have the basis 


#A =0101 0101 0101 0101 0101 0101 0101 0101 
#B = 0011 0011 0011 0011 0011 0011 0011 0011 
#C = 0000 1111 0000 1111 0000 1111 0000 1111 
#D = 0000 0000 1111 1111 0000 0000 1111 1111 
#E = 0000 0000 0000 0000 1111 1111 1111 1111 





Since 
#(A-B) = 0001 0001 0001 0001 0001 0001 0001 0001 
#(A +B) =0111 O111 0111 O111 0111 0111 0111 O111 
#(A +B) =1110 1110 1110 1110 1110 1110 1110 1110 
#A-B = 1000 1000 1000 1000 1000 1000 1000 1000 

and 

#(0-D- EF) = 1111 0000 0000 0000 0000 0000 0000 0000 
#(C-D-#F) =0000 1111 0000 0000 0000 0000 0000 0000 
#(C-D- E) = 0000 0000 1111 0000 0000 0000 0000 0000 
#(C-D-B) = 0000 0000 0000 1111 0000 0000 0000 0000 
#(D-E) = 0000 0000 0000 0000 1111 1111 0000 0000 
#(D-E) = 0000 0000 0000 0000 0000 0000 1111 1111 


we can immediately write down the following mongrel form: 
(A-B)- (6: DB) + (A+B)-(C-D-B) + (A4+B)-(6-D-B) 
+ (A+B): (C-D-E)+ (A+ B)- (DE) + (A-B)-(D- EB) 
A useful fact to remember in writing mongrel forms is that 
0110 = #(A-B+A-B) 
and 1001 = #(A-B+A4-B) 
Of course this can be generalized to the observations, for example, that 
0000 1111 1111 0000 0000 1111 1111 0000 = #(C-D+6-H) 
and 


1111 0000 0000 1111 1111 6000 0000 1111 = #(C-D+64-D) 


Applying this to 0001 0111 0111 0001 1110 1110 1000 1000, we find 
the mongrel form 


(A+B): (C-D+C-D)+(A+B)-(C-D+6-D)|-E 
+[(A+B)-D+A-B-D\-E 
The writing of “good” mongrel forms depends on ingenuity and 
practice. Of course the individual component products and sums need 
hot be written out explicitly as was done above. For example, consider 
O110 1001 1001 0110. A little thought will enable the following mongrel 
form to be written down immediately: 


(A:B+ A+B): (C+ D4+0:D) + (A: B+A:B)-(C:D+6:-D) 


ll 


ll 


ll 
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10-7. Simplest Sum-of-products Representation} 


The Role of the Prime Implicants. This symbolic i erases ie 
designation number has the property of being a sum of origi in ‘ 
least number of operations - and +. There may be several such form 
associated with a single designation number. A systematic isos can 
be described that will result in such a ae Py poner this process, 

iscuss the general nature of a product. 

ea of pies that enter into a product is called the eo 
terms in the product. Thus A: B-C is a three-term product, 
is a two-term product, and A is called a one-term product. ae 2 
for n basic symbols, the designation number of an n-term pro ie , 
2° (=1) unit, of an (n — 1)-term product the designation number 7 
2! units, of an (n — 2)-term product it has 2? units, . oo of an (n — ‘ - 
term product the designation number has 2* units. F a ee e, 
#(A-+-B-C) = 0000 0100 with 1 unit, #(A-B) = 0100 ) via 
units, and #A = 0101 0101 with 4 units. pengeheD ern = 
products that follows directly is that, if P and P’ are products suc “ 
#P— #P’, then P’ is formed from P by deleting some of the sie 
For example, #(A:8-C)— #(A- 8), the latter being forme 
deleting C from the former. The converse is evidently also true. j 

Now we can turn to the discussion of the simplest sum-of-pro "i 
form. If #F is the designation number in question, consider a iy 7 
sum-of-products representation of #F’. Clearly each product P; 7 
sum must be such that #P;— #F (that is, #P; cannot have a 8 
positions where #F has zeros), for otherwise the sum of the produ 
would not have the same designation number as #F. Also, if this oa 
indeed the simplest sum of products, then certainly no product *9 0 " 
sum can be replaced by a product P; such that P;-> P;, for t a j 
resulting sum of products would become even simpler (i.e., We a 
products would have a term deleted). In other words, in order a 
product P be considered at all for inclusion in a simplest sum-of-pr 
form for #F, it must be such that: 

Po #F. 

5 idee exists no P’ such that #P— # P’ and #P!—> #F. 

If P has the first property, it is called an implicant of F; if it also 
the second, it is a prime implicant of F. 


+ The method of this section is an adaptation of McCluskey’s variation of Qui 
method of simplification. (See E. J. McCluskey, Jr., Minimization of Boolean 
tions, Bell System Tech. J., vol. 35, p. 1417, November, 1956. See also Ww. ie Q 
The Problem of Simplifying Truth Functions, Am. Math, Monthly, vol, 59, nO, 
pp. 521-531, October, 1952.) 
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For example if #F = 0111 0100, then A-B-G, A-B- C,A-B-O, 
A-B-C, A-C, A+B, and B-G are all implicants of F, as can be 
readily checked, but only A - CA -B, and B-C are prime implicants 
of F. To show that A - B is a prime implicant of F, note that the only 
products implied by A - Bare A and B. But #A + #F, and #B +#F, 
and hence A - B is a prime implicant of F. (On the other hand, A - B - Cis 
not a prime implicant because #A-B-C— #A-Band #A-B—> #F.) 

There are two stages in determining the simplest sum-of-products form 
of F. The first is the systematic generation of all the prime implicants 
of F; the second is the selection from this collection of a subcollection of 
products the sum of which will truly be the simplest sum of products, as 
defined above. We shall now describe in detail how to generate all the 
prime implicants of a given designation number. Then we shall show 


how with the aid of a so-called prime-implicant chart the final simplest 
sum of products is chosen. 


The First Stage—Generating the Prime I mplicants. 
for generating the prime implicants can best be understood by examining 
the first canonical form. For 0111 0100 the first canonical form is 
A:B-C+A-B-C+A-B-C+A-B-C. The following reduc- 
tions can be made by combining pairs of products: 


The rationale 


ony) 


ic 
83 


A-B-(64+0)=A-B-I=A-B 


A-B- 
{-B- (A+ A)°B-C=1-B-G=B-6 


tb 


+A 
+A: 


a 
QQ 


Il 


Thus the first canonical form reduces to A - B +B-C. We can deter- 
mine whether or not two products can be combined by comparing the pair 
of associated basis columns: if the two columns differ in just one row, the 
ferms can be combined. For instance, the pair A-B-C, A-B-C 


1 1 

corresponds to the columns 0, °, which differ only in the third row; 
7 - 2 01 

similarly the pair A-B-C, A-B-@ corresponds to the columns 1, 1 


0’ 0’ 
which differ only in the first row. Based on these concepts, the first 


stage involves three steps. We shall illustrate these steps by following 
through a specific example. 


Jonsider the following designation number in four basic symbols: 


Oo 1 3 4 6-7 9 10 11 13 14 15 
Ptot.t10o0%41 012111 0114 


The first step is to display only the columns of the basis that correspond 
to the units of the given designation number: 
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0 1 3 4 6 7 9 10 11 13 14 15 
0110011011021 
0010110110121 
0001110001121 
0000001112112 


The second step is to arrange these columns in order by the number of 
units contained in each column: 


0-unit l-unit | 2unit | 3-unit 4-unit 
columns | columns | columns | columns columns 
0 10 1010 1110 1 
0 00 1101 1101 1 
0 01 0100 1011 1 
0 00 0011 0111 1 


i i i i ini irs of columns that 
The third step is an iterative process of combining pairs of colu ¥ 


7 0 o ined to form 
‘ : : and 9 are combined to form 9, 
differ only in a single row. For example, 0 9 0 


0 0 

Be 0 0 7 
where the symbol @ means 0 or 1. Similarly 0 and H are combined to 
form ° Note that, since the pairs are to differ in a single row only, 


each sat column need be compared with only (uw + 1)-unit columns, 


To combine with a u-unit column, a (wu + 1)-unit column must have u 
its uw + 1 units in the same rows as the u-unit column |i.e., the u-u 
column must imply the (wu + 1)-unit column]. Thus the 1-unit colum 
are combined with the 2-unit columns as follows: 


eoord 
oroecsy 
CocOorrE, 
Crrog 
HeoOoRS, 
Coro 


ooer 
eocr 
oreo 


When a pair of columns are combined, a check is placed above them 
record the fact. ‘As will be seen shortly, this checking process is of 
utmost importance. In this way the adjacent collections of welt 
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columns and (u + 1)-unit columns are successively compared and 


checked. In our example, this results in 
Vv VV VVVV VVVV Vv 
0 10 1010 1110 1 
0 00 1101 1101 1 
0 O1 0100 1011 1 
0 00 0011 O111 1 
0 | 110 11001100 1116 
00 006 11110011 1161 
0o 001 00110000 1lo1l 
00 000 00001111 @1il 


The process is continued on this and each successive set of combined 
columns. But this time for two combined columns to be further com- 
bined they must have 6’s as well as units in the same rows. The process 
ends when no further combinations can be formed. For our example the 
rest of the work is shown below: 


VV VIN NNN AAA VVSIV 
©0 | 110 | 11001100 | 1110 
00 | 00 | 11110011 | 11061 
oo | oot | o0110000 | 1011 
00 | 000 | oo0e1111 | o111 
1 \ 1 Vo Vi NV o W 
6 NN 1 \ L \ o NN 1 \N 
0X Ni Ne WeN 
Nie Ne ViNViY 


Duplicate columns have been shown hatched above. The prime implicants 
o 


can all now be read from the unchecked columns. 


1 - 
responds to BC’: D, or § corresponds to A -C, and so forth. For our 
o 


example the prime implicants are thus B-C-D, A: B-D, A-C-D, 
A-C, A:B,B-C, A-+D,and B-D. 

The Second Stage—U sing the Prime-implicant Chart. A chart showing 
the prime implicants is used in selecting for inclusion in the simplest sum 
of products those prime implicants which will produce the given designa- 
tion number with the least number of operations - and +. The chart 
is simply a display of the prime-implicant designation-number unit. posi- 
tions. For our example this chart is shown in Table 10-2. (Note that, 
(o determine, for instance, the unit positions of #82» C+D, we simply 


For example, § cor- 
0 
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é 
observe that B- C- D corresponded to ° The in the first row means 
0 
0 1 
that A was eliminated between the two columns 9 and 03 the zeros indi- 


cate that B, C, and D must be complemented in the three positions. 
1 

As another illustration, A -C corresponds to 6; the ®’s indicate that 
o 


B and D have been eliminated, and the zero indicates that C must be 
complemented.) 


TasLE 10-2. PRIME-IMPLICANT CHART 





Unit positions of the given designation number #F 








0134679 10 1 1B M4 15 

B-C-D| 0 1 
A-B-D 
A-c-D 4 6 

A-¢ 9 1 

A-B 3 7 11 15 

B-C 6 7 1415 
—A-D 9 u @ 15 
—B-D 11 1415 





To choose the appropriate prime implicants for inclusion in the si 
plest sum of products, first note that, A - D and B- D must be includ 
because the units of #F in positions 10 and 13 are covered only by B* 
and A- D, respectively. If either B- D or A~- D were left out of 0 
sum, the designation number of the sum, lacking units in positions |} 
or 13, could never be the same as #F. Prime implicants that must f 
this reason be included are called essential prime implicants. Th 
part of our sum must be A: D+ B- D. 

Next note that #(A4- D+ B-D) covers positions 9, 10, 11, 13, 1 
and 15, leaving only positions 0, 1, 3, 4, 6, and 7 to be covered by ad: 
tional prime implicants. The second step in using the prime-impli¢ 
chart is, then, to choose in the best possible way from the nonessen 
prime implicants those combinations whose sum will cover unit po 
tions of #F not already covered by the sum of the essential prime imp 
cants. There is no known systematic procedure for doing this, ex 
for trying all combinations. For our illustration it can be seen th 
A-B-D+A-C+4+B-C will cover positions 0, 1, 3, 4, 6, and 7 in 
best way. (Note that B-C:-D+A-C-D+A-B will also co 
positions 0, 1, 3, 4, 6, and 7, but has one more operation » than the 
choice.) 
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Thus we finally have for the simplest sum-of-products form of the 
designation number 1101 1011 0111 0111 


A-D+B-D+A-B-D+A-C+B-C 


Summary. Summarizing, the process of finding the simplest sum-of- 
products form for a given designation number #/ has two stages. The 
first stage is the determination of all the prime implicants of #F by com- 
bining pairs of basis columns that correspond to the unit positions of #F. 
To aid the formation of the combinations, the columns are first ordered 
into a O-unit collection, 1-unit collection, 2-unit collection, etc. The 
prime implicants are represented by those columns which do not enter 
into any further combinations, i.e., which are left unchecked when all 
combinations have been made. The second stage is to choose in the best 
way those prime implicants which are to be included in the simplest sum 
of products. Asan aid to making this choice the prime-implicant chart is 
constructed, and from this essential prime implicants are noted. Then 
from the nonessential prime implicants a best sum is chosen to cover those 
units of #F not covered by the sum of the essential prime implicants. 
The simplest sum of products then becomes the sum of the essential 
prime implicants and this best sum of the nonessential prime implicants. 

Decimal Method for Simplification.t{ The work of finding the simplest 
sum-of-products form by the method just given can be shortened by 
considering the decimal numbers denoting basis columns instead of the 
columns themselves. Consider again the example of this section, the 
designation number 

01 3 4 6 7 9 10 11 13 14 15 
#F=1 101 1011 034 211 +01 1 21 


Instead of writing the columns in groups of ascending numbers of included 
units, 


0 
1 see 
1 


cooco 
coor 
oroco 
corr 


0 


we merely write the groups of decimal numbers of those columns: 0; 1, 4; 
3, 6, 9, 10; 7, 11, 18, 14; 15. Note that if a column is to combine with 
another in the next adjacent group then the number in the latter group 
must be greater than the number in the former by a power of 2. Hence 
we need list (in groups) for the combined result only the least number, 
and the difference, which corresponds to the power of 2 by which the 
columns differ: 0-1 for columns 0 and 1, 0-4 for columns 0 and 4, 1-2 for 
| and 3, 1-8 for | and 9, 4-2 for 4 and 6 (but not 4-1 for 4 and 3), 3-4 for 
3 and 7, ete, Just as combined columns recombine only if they have 


| This method was developed by James Bruce Wilson, 
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in the same positions, so these pairs combine only if they have common 
differences: thus 1-2 and 9-2 combine, and we write 1-2,8 (also 1-8 and 3-8 
combine to form 1-2,8). The work for this example would be tabulated 
as in Table 10-3. No combinations are possible in the final row of the 
table, because no pairs have the same two differences. 

The prime implicants are represented by the unchecked numbers of 
Table 10-3: namely, 0-1; 0-4; 4-2; 1-2,8; 3-4,8; 6-1,8; 9-2,4; 10-1,4. The 


Tapie 10-3. Work ror THE DEcimMaAL METHOD 


| 1; 413; 6; 5 11; 13; P % 
vv vv v 


0-1; 0-4 | 1-2; 1-8; 4-2 | 3-4; 3-8; 6-1; 6-8; 9-2; 9-4; 10-1; 10-4 | 7-8; 11-4; 13-2; 14-1 
1-2,8; 1-8,2 | 3-4,8; 6-1,8; 6-8,1; 9-2,4; 9-4,2; 10-1,4; 10-4,1 


Vv 
o|1 


vvv 
4 | 3; 6; 


’ 


or, on eliminating duplications, 


1-2,8 | 3-4,8; 6-1,8; 9-2,4; 10-1,4 





prime-implicant chart (see Table 10-2) is formed by adding to each least 
number all combinations of its associated differences. To find the expres= 
sion for one of these combination numbers, we omit variables that cor- 
respond to powers of 2 found in the differences and bar those variables 
whose powers do not add to make the first number, to the left of the dash, 
Thus 9-2,4 corresponds to A - D (since B and C are deleted and 1 + 8 = 9) 
and 4-2 corresponds to A: C'’- D (since B is deleted and0 + 4 +0 = 4). 


10-8. Obtaining Essential Prime Implicants Directly { 


Preliminary Notational Review. Let us recall the meaning - our 
1 1 1 


- O 0 d 0 Si 
i 0; ts both 9 and». Si 
notation. If 0 is column 1 of a basis, 0 represen 0 0 
RTE till 
i O00. 4 If a column 
: he 23 basis columns 9, 0, 1) 0) 1) 0) 1) 1+ 
larly ; represents t > Op Ty O Ly O» By 3 


in p of its rows, it represents 2? basis columns, in which the p TO 


. 0 
are filled in all 2” possible ways. Recall also that. we interpret 0 


1 
mean A: B&B: C, with D eliminated; similarly : means A, with B, C, and 


D eliminated, and so forth. We interpret a basis column as a bina, 
number, with the least significant digit on top, and thus the rows in 


+ The method of this section is based on R. D. Elbourn’s adaptation of a meth 
developed by Harris. (See B. Harris, An Algorithm for Determining Mini 
Representation of a Logic Function, JRE Trans. on Electronic Computers, vol, Ty 
pp. 103-108, June, 1957.) 
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basis column are associated with successive powers of 2; for example, for 
four variables 
AoQ= 
Beo2! 
Co 2? 
Dow= 


ll 
cor Ne 


Hence we shall refer to the power position of a bit in a column (e.g., 
1 41 


column 1, namely, $, has a unit in power position 1, and ° has units in 
0 1 
power positions 1 and 8). Finally, observe that the 2” basis column 


numbers represented by a column with p of its components ® can be found 

by taking the number formed with the ©’s read as zeros and adding 

all combinations of powers of 2 corresponding to the power positions 
1 


of the @’s. For example, § represents basis columns 1 and 1 + 8 = 9, 
C) 


1 
and ¢ represents basis columns 1, 1 +8 = 9, 1+4=5, 1+2 =3, 
o 


14+84+4= 13,14+8+2=11,1+4+2=7, and1+8+4+4+2 
= 15. 

Finding the HKssential Prime Implicants Directly. In essence, in the 
previous simplification method we tried all combinations to obtain 
columns with the most ©’s by a uniform ‘‘building-up” process. These 
columns then represented the prime implicants, and we had to make a 
prime-implicant table in order to choose the essential prime implicants. 
Now, however, we shall approach the problem from a different point of 
view. We focus our attention on a single column at a time (a column cor- 
responding to a unit of the given designation number); if we can show 
that this column is in a single prime implicant, then that prime implicant 
must be essential (in such a case we say that the essential prime implicant 
is based on this column). If we make this examination for each (unit) 
column, then clearly we shall have determined all essential prime impli- 
cants, because an essential prime implicant is a prime implicant that has 
at least one (unit) column contained in no other prime implicant. 

To find whether a column is in a single prime implicant, first find all 
columns that differ from it in only one power position, i.e., by only 
| bit. Suppose that there are p such power positions (that is, p columns). 
Then determine whether or not columns occur that have all possible 
combinations of 0 and 1 in these power positions, that is, 2? columns 
altogether. If so, the original column is in an essential prime implicant. 


Suppose, for example, we are given a column §, The first step is 
0 


(o find all columns which differ from this by only 1 bit, Suppose that 
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1 

11 1 1 

columns °, $, and 6 also occurred (but no others that differed from o by 
1 0 


0 
1 bit). 
1 
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Then—and this is the crux of our whole argument—the column 


will be in a single prime implicant (ie., essential) only if columns 


1 


0 
0 
0 
0 1 h the total combina- 
? 1 and j also occurred, for then we would have the tota 

’ ’ 
1 0 1 


HOR 


’ 


I . . . 
tion written as §. The essential prime implicant in this case would be 


J 1 


1 1 + . 
simply A. On the other hand, suppose, say, that 1 were missing; then 


we have the following reduction by our above simplification method: 


111i 111 1 
tj1 1 ijt 11 11 1j1 1 ia 14 
0*¢0o11 ? 
0/6 9 310 21 then 8 9 $18 $$ fo & then & 6 4; that is, o is im 
bi. 0 o|1 1 0 ¢ 0 0l1 1 @ 0 @ O o 6 


ime implicants. 
ph ecian M ethod of Procedure. A systematic method of procedure is 
as follows: Write out the columns corresponding to units of the designa- 
tion number. For any given column under consideration find another 
column that also occurs differing in 1 bit, say in power Position pi. 
Next find a second column that occurs differing from the original columns 
in 1 bit, say in power position ps; then make sure the other column 
occurs that completes the 2? combinations of 0 and 1 in both these power 
positions, p: and p» Next find a third column that occurs differing 1 
1 bit from the original column, say in power position Ps} make sure 
other columns also occur that complete all 2? combinations of 0 and 
in these three power positions p1, P2, Ps; and so forth. This process wi 
stop either when for a certain number of power positions all combin 
tions do not occur (in which case our original column is in more than 0 
prime implicant) or when no further columns oceur that differ from t | 
original column in only one Sea ae eae (in which case the origin 

is in an essential prime implicant). 
Ca eee can be : abilated: Let us illustrate with the followi 
designation number: 


1 3 4 5 6 7 9 13 14 15 
0101 a11 1 01 01 O11 


The first step is to make the prime-implicant table format with t 
successive power positions to the right: 


Power positions 
DAO ee 
1 4 3.5 6 9 7 ddyvd8old 15 8. 40) aougd 
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We have grouped the column numbers according to the number of units 
in the column. Start, say, with the leftmost column as the original 
given column, and write its binary form (as a row for convenience) under 
the power positions; place an x under the 1 to record that this is the 


column under consideration: 
D C BA 
1 4 3.5 6 9 7 11 #13 «214 15 8 4 2 1 





x 0 001 


Now ask: Does the column occur that differs from the original column in 
power position 8, that is, does column 1 + 8 = 9 occur? It does; so we 


record this by placing an 8 under column number 9 and changing 0001 to 
0001 since both these columns do occur: 





DC BA 
14/3569 |7 WW 13 14 | i |8 4 2 1 
x | 8 | | * 0 0 1 


Next ask: Does the column occur that differs from the original column in 
power position 4, that is, does column 1+ 4 = 5 occur? Yes. Do all 
2? combinations occur? To answer this, note that 9+ 4 = 138, and 
column 13 does occur, so the answer is “‘yes.’’ Thus we can change ¢001 
to 0601, and record the work as follows: 


DC B A 
1 4 3.5 6 9 7 11 13 ~«214 15 8 4 2 1 





*% 4 8 | se | o ¢ 0 1 


Next, try power position 2: column 1 + 2 = 8 occurs; so try 5 + 2 = 7, 
9 +2 = 11, and 13 +2 = 15. All these columns occur, and hence all 


combinations of 0 and 1 occur in the three power positions 2, 4, and 8; 
hence we have 


14 / 3 5 6 9 7 ll 13 «#214 15 








x 24 si|224 a[o #1 


Next try power position 1. Note that, since column 1 has a unit in power 
position 1, the column that differs from column 1 in this position only is 
column 1 — 1 = 0. But column 0 does not occur, and therefore column 1 
is in a single, hence essential, prime implicant, namely, A. 

At this point we are ready to see the main advantage of the present 
method, Since we have found an essential prime implicant that covers 
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columns 1, 3, 5, 9, 7, 11, 13, and 15, we need no longer worry about covering 
them. So our problem is reduced to covering only columns 4, 6, and 14. 
Thus we check off all columns except these and choose column 4 as our 
next given column, 











Vv vv Viv viv Vv D C BA Essential 
1 4 3.5 6 9/7 11 138 #14 15 8 4 2 1 | prime implicant 
x 2 4 8/2 2 4 2 o ® o® 1 A 

x 0 1 0 0 








The process for column 4 is the same. However, note that, in consider- 
ing a power position corresponding to a zero in the original column, we add 
this power of 2 in order to determine what combinations of columns should 
occur, i.e., change the zero to a unit; if the power position corresponds to a 
unit in the original column, we would subtract this power of 2 in order to 
determine what columns should occur, i.e., change the 1 to a 0. The work 
for column 4 becomes 








VV |V VV VIV Vv Ov Vv DC BoA Essential 
14/13 5 69/7 11 138 14 15 8 4 2. 1 | prime implicant 
x 2 4 8};2 2 4 2 o o o 1 A 

x 1 2 1 Oo 1 oe @ D-c 


Here column 4+ 8 = 12 did not occur, nor did column 4 — 4 = 0, 
But column 4 + 2 = 6 did occur, and so did columns 4 + 1 = 5 and 
6-+1=7. Thus we can check columns 4 and 6. Only column 14 
remains unchecked; so we choose this column as the next given column, 
The work appears as follows: 





VV |VVVVIV Vv VO Nv V D C B A Essential 
14/3 5 6 9/7 11 13 14) #15 8 4 2 1 |prime implicant 
x 2 4 8/2 2 4 2 o o® o 1 A 
x 1 2 1 0 at o o D-C 
8 1 x 1 ’ 1 i C:-B 


Column 14 — 8 = 6 occurs; column 14 — 4 = 10 does not occur; column 
14 — 2 = 12 does not occur; columns 14 + 1 = 15, 6 + 1 = 7 do occur, 
Hence C - B is an essential prime implicant. 

Examples and Discussion. Consider the following designation number, 
and let us find its simplest sum-of-products form: 


1 3 5 6 7 8 
01041 011i 100 0 770 Died 
16 17 19 20 21 22 23 26 27 40 


1audinO (Letieleolee bdtet Quilt da one ii 
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The work format appears as follows: 








Vv Vilvvv v Vi[IVv vv Vv Viveov E DCBA Essential 
1 8 16/3 5 6 17 20/7 19 21 22 26/23 27 30/16 8 2 1] prime implicants 
x 2 4 16 2 2 4 2 ¢ 0 @ © 1|/D°A 
x 0 10 0 O|E-D-C:B-A 
x 1 4 1 10 @ 0 | E-D-B 
x 1 16 1 @ 01 1 6/D-c-B 
x @ 4/ 1 1 6 10 
1 x 1101 06 
8 © x} 1 @ 1 1 0 














Note that in adding (or subtracting) to a column number the result 7s always 
in an adjacent group of column numbers (to the right in adding, to the left 
in subtracting). For column 1 we find 1 +16 = 17, which occurs; 
1 + 8 = 9, which does not occur; 1 + 4 = 5,17 + 4 = 21, both of which 
occur; 1 +2 = 38, 5+2=7, 17 +2 = 19, and 21 + 2 = 23, all of 
which occur; but 1 — 1 = 0 does not occur. Thus an essential prime 
implicant has been found, namely, D - A, and column numbers 1, 3, 5, 17, 
7, 19, 21, and 23 need not be used as a given column again. Note that 
nothing combined with column 8. When column number 26 is the given 
column, note that neither 26 — 16 = 10 nor 26 — 8 = 18 occurs; but 
26 + 4 = 30 occurs. Continuing, 26 — 2 = 24 does not occur, but 
26 + 1 = 27 does occur; however, 30 + 1 = 31 does not occur, and no 
essential prime implicant is found—we encircle the 1 to show that it 
did not combine with 30. Similarly, when 27 is the given column num- 
ber, 27 — 1 = 26 occurs, and 27 — 8 = 19 occurs; but 26 — 8 = 18 does 
not occur, and so we have encircled the 8. Three columns are not covered 
by essential prime implicants, 26, 27, and 30; these can be covered by 
combinations of the last three implicants (or those indicated by the 
circled numbers). Hence one result is 


A:-D+A-B-6-D-E+8B-D-E4+B-C-D 
+B-C-D-E+A-B-D-E 


Consider a six-variable problem with units in the following designation- 
number positions: 1, 8, 4, 5, 7, 13, 17, 19, 20, 23, 31, 33, 35, 36, 37, 39, 
44, 45, 49, 51, 52, 53, 55, 60, and 61. The work is shown in Table 10-4, 
in which the columns are grouped appropriately. Here we have intro- 
duced a variation of the present method. For example, starting with 
column number 1 we find that 1 + 32 = 33 occurs; 1 + 16 = 17 and 
33 + 16 = 49 both occur; 1 +8 = 9 does not occur; but although 
1+ 4 = 5 oceurs, 17 +4 = 21 does not occur, and so we encircle 4. 
Now instead of proceeding as above let us see if we can at this point make 
further use of the information so far obtained, namely, 6 ® 0 0 0 1. 
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By our method of procedure 


in the fourth row covers the unchecked columns. 


+ Note that @ 60 11, corresponding to D- B- A, v c 
this is as good as any other product which covers them, and hence we include D- B- A in our simplest sum of products. 
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To do this, suppose that we transfer to column number 17 as our given 
column; we indicate this transfer by putting a slash through the 16 
below column number 17, as is shown. Then @ 6 0 0 0 1 still holds 
for column 17, and we need not try power 32 or power 16 again or, for 
that matter, power 4 (why?). But we must try power 8 again, for we 
have never tried it on this column (why not?). Thus 17 + 8 = 25 does 
not occur; but 17+2= 19, 1+ 2 = 3, 33+2 = 35, 49+2 = 51 
all occur, giving us @ 60001. Finally 17 — 1 = 16 does not occur; 
so we have discovered an essential prime implicant, based on column 
17 rather than on the column we started with, column 1. The next 
two rows of work in Table 10-4 also indicate a transfer to some other 
column number in finding the essential prime implicant. 

Next observe the fourth row of work. Here we meet several further 
circumstances, which we shall explain by means of the illustration. The 
original column number is 7; 7 + 32 = 39 occurs; 7 + 16 = 23 and 
39 + 16 = 55 occur;7 + 8 = 15 does not occur;7 — 4 = 3,23 — 4 = 19, 
39 — 4 = 35, and 55 — 4 = 51 all occur, giving so far 6 @ 0 0 1 1. 
Next observe that 7 — 2 = 5, 3-—2=1, 19-2 = 17, 35 — 2 = 33 
all occur but that 23 — 2 = 21 does not. Transferring to 23 as above, 
we need not try powers 32, 16, 4, or 2 again. Trying 8, we find that 
23 + 8 = 31 occurs but that 3 + 8 = 11 does not occur. We could try 
to transfer once again, completing the crossing off of the 16 below column 
number 23 for the record. But we must transfer to a column with the 
following properties: (1) it is not covered by a previously determined essential 
prime implicant and (2) it does not combine with powers that have already 
been tried and circled. If the transfer were made to a column already 
covered, then this column obviously could not be the basis for another 
essential prime implicant; if the column to which we transferred combined 
with a circled power, then this power clearly would not again allow the 
completion of an essential prime implicant. For our illustration it is 
seen that no further column satisfying ® ® 0 ® 1 1 as well as our con- 
ditions 1 and 2 exists. Thus we must go on to a new row of work, i.e., 
another uncovered column. Note that since we already have determined 
that no column satisfying 6 ® 0 © 1 1 can be the basis of an essential 
prime implicant we should not go on to any of these columns either. For 
our illustration the next given column will be 44, and the rest of the work 
is completed as above. 


10-9. Constraints 


The Problem. Up to now we have been considering systems of inde- 
pendent elementary elements. That is, for elementary elements with the 
basis given above, the truth value of each elementary element is inde- 
pendent of the truth value of any other elementary element. It often 
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happens, however, that this is not the case; the truth value of an ele- 
mentary element may depend on the truth values of the other elementary 
elements for a particular problem. Such a dependence is given in terms 
of a combined element which is to remain true for this problem. For 
example, in a problem we might always have that A— B. The effect 
‘of such a constraint is to limit the possible combinations of truth values 
that A and B can take simultaneously. If A— B is to hold, then B 
cannot be false when A is true; i.e., the combination of truth values A 
true, B false cannot occur. 

Summarizing, a constraint is a logical relation between the elementary 
elements (i.e., a combined element) which is to be considered as tautologically 
true (i.e., always true) because of the stated or intrinsic circumstances of t 
problem. Put another way, constraints are factually true combined 
elements which are to remain true throughout a problem and therefore 
with respect to this problem, can be considered as tautologies. In thi 
section will be given a systematic computational method that auto: 
matically guarantees the maintenance of any desired constraints throug 
out the solution of a problem, so that all advantages produced by t 
constraints can be obtained in a straightforward manner. 

The effect of constraints is to limit the number of input states; i.e, 
some simultaneous combinations of truth values are forbidden by the con 
straints. For example, suppose that, for three variables A, B, and C,n 
two could both be true simultaneously, and it never occurs that all th 
are false simultaneously. These constraints can be written as follo 
A-B=0,A-C=0,B-C=0,andA-B-C =0. 

We have remarked that for n inputs there are 2” columns in the bai 
one column for each of the 2” possible input states. With constrain 
however, some of these 2” states are forbidden. Hence it seems plausi 
that in order to take account of the constraints during a problem 
should simply cross off those columns in the basis which represent f 
bidden states of the particular problem—forming a new, reduced, ¢o 
strained basis. Then we should proceed as described in the previ 
sections, except that all computational methods should be performed 
respect to this constrained basis. This is the method for taking account 
constraints. 

The first problem that arises is: Given a set of constraints as Boole 
functions, which columns of the basis are to be crossed off to form t 
constrained basis? Conversely, if we are given the columns to be cro# 
off or the constrained basis, what is the Boolean form of the constraint 

Determining the Constrained Basis. Let us consider the first of th 
problems, to find the constrained basis, given the constraints. 
that fi(A,B,C, . . .), fl A BiCy + wodyince «lye apB Gye 
straints which are all to be tautologically true simultaneously. 
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the logical product of the designation numbers of the constraints, 


n 

I] #5(A,B,C, ...) =flA,BC,.. fafa +: ie 

jel 
It can be seen that the positions of the zeros in the resulting number 
correspond to input states that can never occur if the constraints are 
all to remain true. Hence the constrained basis is formed by crossing 
off those columns of the usual basis that correspond to the positions of the 
zeros of the product number. 


For example, consider the designation numbers of the constraints of 
the above example, 


#(A-B=0) = 1110 1110 
#(A-C =0) = 1111 1010 
#(B-C =0) = 1111 1100 
#(A-B-C =0) =O111 1111 
Product 0110 1000 

0 38 567 


Crossing off columns 0, 3, 5, 6, and 7 of the usual basis, we find for the 
constrained basis 


#A = 100 
#B = 010 
#C = 001 


Conversely, given a constrained basis, the explicit form of the con- 
straints can be determined as a Boolean function. We shall give this 


procedure by means of an example. Suppose that the constrained basis 
is 

#A =0101 O1 

#B =0011 00 

#C = 0000 11 


The columns of this constrained basis correspond to the columns in posi- 
tions 0, 1, 2, 3, 4, and 5 of the standard basis. Hence we know that the 


designation number of the constraint must have units in these positions, 
01 23 45 


and zeros elsewhere: 11 11 11 00. With respect to the (unconstrained) 
standard basis this is the designation number of B + C, which is the 
desired constraint. Note that B+ C can equivalently be written as 
B+C=T1or B-C =0. In finding a constraint from a constrained 
basis only a single constraint, corresponding to the final product of con- 
straints, will appear, as was the case in our example. 

Designation Numbers with Respect to a Constrained Basis. As stated 
above, when constraints exist in a problem the computational methods for 
thal problem are to be carried out with respect to the constrained basis. 
Hence we must be able (1) to find the designation number of a Boolean 
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function with respect to the constrained basis and conversely (2) to find 
the Boolean function corresponding to a given designation number (in any 
of the various forms considered above) with respect to the constrained 
These computations are identical with those discussed above 
How- 


basis. 
except that all procedures are with respect to the constrained basts. 
ever, for clarity we shall illustrate these procedures. 
For example, consider the following constrained basis: 
#A =0101 O1 


#B=0011 00 
#C = 0000 11 


To find #(A + B-C) with respect to this basis, we have 
#B=0011 00 
#C = 0000 I1 
#(B-C) = 0000 00 
#A =1010 10 
#(4 +B-C) =1010 10 





Note that in this case it is simply # A. 

For the converse problem of finding the Boolean function of a give’ 
designation number with respect to a constrained basis, the designati 
number can be given in two different ways: either (1) with the same nu 
ber of positions as columns of the constrained basis, in which case it 
assumed that the positions of the designation number correspond to t 
columns of the constrained basis, or (2) with 2” positions, for n element 
elements. In this second case the positions of the given designatio 
numbers that correspond to the missing columns of the constrained b 
are crossed off, and the second case is thereby reduced to the first ¢a 
For instance, suppose that it were desired to find the Boolean functi 
corresponding to 1010 1011 with respect to the constrained basis gi 
in the above example. Columns 6 and 7 of the standard basis are mi 
in this constrained basis; hence we cross off positions 6 and 7 of our nu 
ber: 1010 10 & 

The methods given for finding the simplest sum-of-products form ¢ 
easily be adapted to take constraints into consideration. Every com 
strained basis column (i.e., eliminated column) can be used freely if 
combines with nonconstrained columns corresponding to the units of 
designation number. However, positions corresponding to constrai 
columns do not necessarily have to be covered by a prime implicant 
do the unit positions of the constrained designation number). In ot 
words, the constrained positions can be thought of as units or 4 
whichever helps best to simplify the function. 

For example, consider 1111 01 with respect to the constrained 
of the above example, By our first simplification method we have 
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columns 
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possible use 
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0 10 il 0 1 
0 ol 10 1 1 
0 | 0 | o1 14 
VV VVV 
. 1100 | Il Formed by 
a oO1l 10 combination with 
0006 o1 | constrained columns 
4 lo 
t') o1 
0 oo 





The table of prime implicants (omitting columns 6 and 7) becomes 





012 3 5 
C 0 1 2 83 
A 1. .3°8 
B 2 3 


whence C and A are essential prime implicants whose sum covers the unit 
positions of the constrained designation number; thus 


1111 01 = #(A 40) 

The second method given above for finding the simplest sum of prod- 
ucts can likewise be adapted to handle constrained designation numbers. 
Consider again the number 1111 01 as an example: 








C BOA 
0 1 2 3.5 6 7 4 2 1 
x 1 2 0 ® o/ ¢ 
4 2% 2 o o 1 A 
Constrained 
positions 


Note that we had to make use of constrained column 7 in order to form all 
combinations with B (power 2) in the second line of the illustration. 
lhe methods for going from the constrained designation number to 
other Boolean function forms, e.g., the first and second canonical forms 
follow easily. 
Constrained Boolean Algebra, 


An important point arises from the last 
example, 


Note that, with respect to the constrained basis given above, 
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many strange relations hold; for example, 
A=A-(B4+0) A+tB=A-B+B-C4+A-C 


Such relations obviously do not hold with respect to the standard uncon- 
strained basis. In other words, with respect to a constrained basis, we 
obtain a constrained Boolean algebra which is different from the uncon- 
strained Boolean algebra described above. That is, additional relations 
hold in a constrained Boolean algebra that do not hold in the uncon- 
strained algebra—but all relations that hold in the unconstrained alge- 
bra also hold in the constrained algebra. In general these additional 
relations in the constrained algebra often allow Boolean functions 
to be further simplified; that is, A «(B+ C) becomes just A, and 
A-B+B-G@+A-C becomes A + B with respect to the constrained 
basis of the above example. However, it is not always possible to 
simplify a function further; for imstance, A-C+A-C cannot be put 
into simpler form with respect to the constrained basis of the aboy 
example. 


10-10. Logical Dependence and Independence 


Logical Independence. Boolean functions are logically independen 
(with respect to each other) if they can take on all possible combinatio 
of truth values. There are 2” possible combinations of truth values th 
n Boolean functions can have (since each can take either of two tru 
values). Hence, in order to test the independence of n Boolean functio 
we need examine only the combinations of truth values that can ocew 
if all 2” are possible, then they are independent; otherwise they are n 
This can be done by means of the designation numbers for the Boole 
functions. The procedure is illustrated by means of an example. 

Suppose that we desire to know whether or not the two functio 
A+-B+A-Band B are independent. Write their designation numbe 
on successive rows, 

#(A-B+A-B 


) = 1001 
#B 


1100 


Wl 


The columns represent all possible combinations of truth values. Sin 
all four column combinations are there, we can say that A - B+ A: 
and B are independent. 

As another example, consider 4: B+ B:C and A-B+B:C. Wi 
write 


#(A-B+B8-C) = 1101 0001 
#(A-B+B-C) = 1000 1011 
110 0 ' ‘ 1 
Here vor and 0 all appear; so the functions are, indeed, independe 
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Consider next the three functions A-B + A-B,B,andA-B+4-B 
Write 
#(A-B + A-B) = 1001 

_  _ #B = 1100 
#(A-B+ 4-B) = 0110 
Here we need 2* = 8 possible combinations in order that the three func- 
tions be independent. But only four columns appear; hence the functions 
are not independent. 

Logical Dependence. Boolean functions that are not logically inde- 
pendent are said to be logically dependent. This means that there must 
be some logical relations between them that can be expressed as a com- 
bined element. For instance, we know that A-B and A + B are 
logically dependent because clearly A -B-—> (A+B). Given a set of 
n Boolean functions that are logically dependent, the explicit form of this 
dependence can be found by means of the designation numbers, in two 
steps. This first step is to write their designation numbers in snédeaeie 
rows. In this array of designation numbers cross off columns so that no 
two identical columns remain. Since the functions are dependent, there 
will be fewer than 2” columns remaining. Put these columns a their 
standard order, and consider this resulting array as a constrained basis. 
rhe second step is to find the constraint in the same manner as in the 
previous section, remembering that the rows are labeled by the given 
functions (and not_elementary elements). For example, consider the 
functions B-C + A-C,A-C+B-C,and B. Let us denote these by 
fi, fo, and fs, respectively. Then we have 


#h = \ SN 
gcioo \ > \ 
#f= 1100 W 10 A 


and on rearrangement in standard order we obtain the constrained basis: 


012 567 
#f: =010 101 
#f2 = 001 O11 
#f; = 000 111 


Hence the designation number of the constraint is 1110 0111. This 
designation number is to be evaluated with respect to the following 
standard (unconstrained) basis: 


#fi, = 0101 0101 
#fo = 0011 0011 
#f; = 0000 1111 


whence we have 1110 O111 = #[(fi + fs) Ja + (fi + fe) «fal. Hence the 
logical dependence between fi, fo, and fy is given by 


HS be Sa) Sub + fx) fol = #1 
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as can be easily verified by substituting for fi, f2, and f; their correspond- 
ing functions in terms of A, B, and C. 
10-11. Boolean Matrices 


Preliminaries: Boolean Matrix Operations. For our purposes, by a 
Boolean matrix A is meant a rectangular array of elements ij each of 
which can have only the value 0 or 1. Boolean matrix multiplication, 
denoted by ®, is defined in a manner directly analogous to ordinary 
matrix multiplication, except that logical addition and logical multiple 
cation replace ordinary addition and multiplication. Thus we write 


A@B=C — where cs = ) dix: by (10-1) 
k 


O01 010 
0) o (i) - (1 
00 000 


We can define the complement A of a Boolean matrix A by 


A=C ~ where cj = Gi; (10-2) 


101 7 re, 


If + denotes logical addition and - logical multiplication, then we ¢ 
define the logical sum and logical product of two Boolean matrices by 


For example, 


For example, if 


where ci; = aij + bij 
where cj; = Qi; * bi; 


A+BHC 
A:B=C 


Ga 4: aa) = la) 
010 001 011 
cu) can) _ ee 
010 001 000 
The transpose A‘ of A is found by interchanging the rows and colu 
of A; that is, if 


For example, 


At=C then ¢;; = Qji (1 


The relation implies is said to hold between two Boolean matrices 
and B, denoted by A — B, if there are no @ and j such that by = 0 
ai; = 1. For example, 


101\ 111 
. 010 110 
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If both A— B and B— A, that is, if the elements of A and of B are 
identical, then we say that A equals B, denoted by A = B. For some 


useful properties of these operations and relations see Exercises 10-98 to 
10-105. 


Solutions to Boolean Matrix Equations. The following theorem is of 
importance to the development of our transformation theory (see below): 
THEOREM 1. If the Boolean matrix equation 


A@X=B (10-6) 
has a solution X, then 


xX’ =A @B (10-7) 
is a solution, and in fact if X is any other solution, then 
X — X’ (10-8) 
Furthermore, if X’ is not a solution, then there are no solutions and 
A @ xX’ B (10-9) 
Proof.t Equation (10-6) is equivalent to the set of equations 


y ai? Ui, = bix 


J 


(10-10) 


If a solution exists, consider Eq. (10-10) for a fixed value of k and those 7 
for which 6, = 0: then for each 7 such that a:;; = 1, the corresponding 
xj, must be 0; for each 7 such that a;; = 0, the 2, can be 0 or 1. For 
those ¢ for which b;, = 1, there must be at least one value for j for which 
both a; = 1 and a; = 1. By the assumption that a solution exists, 
it will be obtained if we set x= 0 only when necessary, i.e., when 


biz + Gj; = 0 for at least one i, that is, when [] (63 + @;) = 0; otherwise 


v 
tj = 1. Denoting this set of x by the 2;,, we have 
ty = Il (bi + Gj) = > bu * ij 
i 7 


This of course holds for each value of k, whence Eq. (10-7) follows. 
Clearly also X’ has units in at least those positions where any other 
solution X has units, and hence Eq. (10-8) follows. If no solution exists, 
then clearly, by the construction of X’, Eq. (10-9) follows, which com- 
pletes the proof. 

Observe that if solutions exist to Eq. (10-6), then by Eq. (10-8) the 
matrix X’ may be considered as the (unique) least upper bound of the 


| See, for example, R, D, Luce, Note on Boolean Matrix Theory, Proc, Am, Math, 
Soe,, vol, 3, pp, 882-388, 1052, 
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set of all solutions. In general there is no, greatest lower bound to the set 
of all solutions. For example, it can be shown} that the equation 


101 10 
(a oe Cc?) ia 


has five solutions as diagramed in Fig. 10-11, where 


10 
xX’ =([11 
10 


Corouuary la. If the Boolean matrix equation Y ® C = D has a 
solution Y, then Y’ = D @ C‘is a solution, and in fact if Y is any other 





Least upper 
bound 


ee . ee, 
x 


00 
01 Lowest 
solution 


10 


Lowest 
solution 


Fia. 10-11. The five solutions to Eq. (10-11); the arrows indicate implication relations, 


solution, then Y— Y’. Furthermore, if Y’ is not a solution, then there 
are no other solutions and Y’ ® C— D. The proof follows from the 
theorem by the relation (Y @ C)' = C' @ Y*. 

The Unitary Matrix Equation. As will be shown below, the Boolean 
transformation matrix has exactly one umit in each column; such @& 
matrix U is called a unitary matrix. A unitary matrix will perform @ 
kind of permutation of the columns of a matrix M multiplied on the 
right. That is, if M @® U = B, and if uj; = 1, then column j of M will 
appear as column 7 of B. For example, 


1000 O110 0110 
1100 0000) _ [o110 
1110} ®\ 0001} = \ ort 
wu 1000 Wl 


+ A method for systematically generating all the solutions was developed by W, Rt, 
Smith; see R. S. Ledley, ‘Digital Computer and Control Engineering,” pp, 445-447, 
McGraw-Hill Book Company, Inc., New York, 1960; see also Exercise 10-116 below, 
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Since clearly the results of complementing a matrix and then permuting 
its columns and the results of permuting its columns and then comple- 
menting it are the same, then 


M®EU=MeU (10-12) 


Thus the unitary matrix equation 

M@QU=B (10-13) 
can be equivalently written as 

M@EU=B (10-14) 


Solutions to the Unitary Matrix Equation. The characteristics of the 
set of solutions to Eq. (10-13), considering M as the unknown, are of 
importance for our purposes. 

THEOREM 2. The set of solutions of Eq. (10-13), considering M as 
the unknown, has a unique least upper bound given by 


M =BeU (10-15) 
and a unique greatest lower bound given by 
M*=BeU (10-16) 


when solutions exist at all. Every other solution M can be found from 
the relation 
M*— M-— M’ (10-17) 


Proof. Equations (10-15) and (10-16) follow from Corollary 1a applied 
to Eqs. (10-13) and (10-14). Next note that Eq. (10-16) gives the com- 
plement of the least-upper-bound solution to Eq. (10-14). But any solu- 
tion M of Eq. (10-14) is also (upon complementation) a solution of Eq. 
(10-13), and conversely. Hence M* given by Eq. (10-16) must be the 
greatest lower bound of solutions to Eq. (10-13). By the meanings of 
least upper bound and greatest lower bound, Eq. (10-17) follows, com- 
pleting the proof. For example, consider the unitary matrix equation 


0100 
1001 ave 
M ® ={ 1001 (10-18) 
0000 1101 
0010. 
0111 0101 
Here M’ = {0110 and M* = {| 0100 
1110 1100 


and the set of all eight possible solutions is shown in Fig. 10-12, 
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Least upper 
0111 bound 
0110 
1110 


\- 
/ 


0111 0111 0101 
0110 0100 0110 
pooed 


0111 
0100 


0101 
0100 
1100 


1110 


Se 


0101 
0110 
1100 


/ 
= 


0101 Greatest lower 
0100 bound 
1100 


Fia. 10-12. The eight solutions to Eq. (10-18) ; the arrows indicate implication relations. 


It is frequently important to solve a unitary matrix equation for the 
unitary matrix itself as the unknown. In this connection we have the 
following theorem: 

TuroreM 3. The set of unitary matrix solutions to Eq. (10-13), con- 
sidering U as the unknown matrix, consists of those unitary matrices U 
such that 

U— R’: R* (10-19) 
where R'=M'@B and R* =M @B (10-20) 


when such solutions exist at all. 
Proof.t From Theorem 1 applied to Eq. (10-13) we have M @ R’—> B, 
where R’ is given by Eqs. (10-20). If a unitary matrix U is such that 


U— R' then M @ U- B (10-21) 
Similarly from Theorem | and Eq. (10-14) we have M @ R* — B, where 
R* is given by Eqs. (10-20). If a unitary matrix U is such that 
U—>R* thnM@®U-B (10-22) 
Recalling that if P— Q then Q— P, we find from Eq. (10-22), using 
Kq. (10-12), that 
BoM @U (10-23) 
Thus if U implies both R’ and R*, that is, if U is such that Eq. (10-19) 
holds, then from Eqs. (10-21) and (10-23) we have 
M@U=B 
which completes the proof. 


+See R. 8. Ledley, Digital Computational Methods in Symbolic Logic, with Exam 
ples in Biochemistry, Proc. Natl. Acad, Sei., vol. 41, no. 7, pp. 498-511, July, 1955, 
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10-12. Boolean Transformations and the Fundamental Formulas 


Boolean Algebraic Transformations. Consider now, for example, a 
Boolean algebraic transformation from variables f1, fo, fs to variables 
A, Ao, and A3, where 


fh = A.+ A; = 
fo = (Ai + Az) As (10-24) 
fs = Ay: Ao + (A14+ Az): As 


Then any Boolean function F(fi,f2,fs) can be written as a function 
E(A,,A2,A3) in terms of the new variables Ai, A2, A; by means of the 
transformation. For example, 


F => fi “fs + fo (10-25) 
becomes by the above transformation 
K= A,: Ag+ Ai: Ao + As ; (10-26) 


We desire to find a Boolean transformation matrix R which when multi- 
plied by the designation number F of a Boolean function of variables 


fi, .. . fz will result in the designation number F of the transformed 


function of the new variables Ai, ..., Az. That is, we wish to find a 
matrix R such thatt 
FeQR=E (10-27) 


where F is with respect to b[fi, ... fz] and E is with respect to 
b[Ay, . . . ,Ar]. The method for determining F is best described by 
means of an illustration. Consider the transformation described by Eqs. 
(10-24), from which the following array is formed: 


t 0123 4667 


A; = 0101 0101 
A, = 0011 0011 
A; = 0000 1111 
fi = 1111 0011 
: fo = 0111 0000 
fs = 0001 O111 
j «188? 0455 


(10-28) 


iol 


Here b[A1,A2,A;] appears above the line, and the designation numbers of 
fi, fa, and f; with respect to b[A1,A2,A3] appear below the line; the italic 
numbers on top denote the basis columns of b[A1,A2,As3], while those on 
the bottom represent basis columns of b[fi,f2,fs]. Observe that each jth 


{In the remainder of the chapter we drop the sign # and use the letters A, EF, F, 
ote,, alone to represent the designation numbers (or matrices) as well as the functions. 
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column of b[f1,fo,fs] corresponds to some column 7 of b[A1,A2,A3] just 
above; therefore, to find EZ, we must permute the bits of F according to 
this correspondence. The permutation can be accomplished by a unitary 
matrix R which has 7; = 1 if column j corresponds to column 7 in the 
array of Eq. (10-28) or otherwise has r;; = 0. Thus for the above 
example 

t 0123 4567 

70 (0000 1000 

1 {1000 0000 

2 70000 0000 

3 10110 0000 
R= (10-29) 

4 10000 0100 

5 10000 0011 

6 {0000 0000 

7 (0001 0000 


Hence for F as in Eq. (10-25) we find 
(1100 1101) @ R= (1001 1111) 


where the right-hand side is the designation number of FE with respect to 
b[A1,A2,Asl, from which K= Ay : A, + A, . A» + A3. 
Clearly, multiple transformations can be handled in an analogous man- 


ner. For example, if we first transform variables fi, . . . , fs into varia- 
bles gi, . . - , gx by Rf and then transform gi, . . . , gx into i, . - a 
hi by R*, and finally hi, ... , Ax into Ai, ..., Ar by R*, then we 
can write 

FOR QOROSR=EH (10-30) 


which is a generalization of Eq. (10-27). 
Most frequently the transformation does not involve all the variables 
under consideration. For instance, the variables may be fi, ..., 


fx, Xi, . . . , Xx, where only fi, . . . , fy are to be transformed in terms 
of, say, Ay, . . . , Ar; then the transform of F(fi, . . . ,fx,X1, - = Xx) 
will be E(Ay, .. . ,Ar,X3, ... ,Xx). In this case the designation 


number of F (and EZ) can be written in full matrix formf instead of vector 
form; so the columns of the matrix correspond to values of fi, . . «5 
fx, and the rows to values of Xi, ... , Xx. Such a matrix, when sub= 
stituted into Eq. (10-27), will only have its columns permuted by Rj 
that is, only the dependence of F on fi, . . . , fy will be transformed, as 
desired. For example, let F be the function fi: fs* Xi + Jo Xo, and 
find 


+ See R. S. Ledley, Boolean Matrix Equations in Digital Circuit Design, TRE Trana, 
on Electronic Computers, vol. WC-8, no, 2, pp, 131-139, June, 1959. 


= 
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j 0123 4567 


fr: 0101 0101 
fe 0011 0011 


fs 0000 1111 
1100 1100, 0 0/0 
1100 1101 1 O0/;1 
F =| o000 0000} 0 1/2 (10-31) 
0000 0101/ 11/3 
X\X_o 


where we have separated b[fi,fo,fs,X1,X2] into two parts, giving on top 
the corresponding values of f1, fs, fs, and on the side those of X; and X 2. f 
Then, if f1, fs, and fs are transformed as in Eqs. (10-24), we find, using F 
and # as in Eqs. (10-31) and (10-29), respectively, substituting into Eq. 
(10-27), that 

1000 1111 


; _fioor ini) 
F@R=|\ ooo9 ooo] == M1082) 


0001 0011 


which is seen to be EZ = (A, + A;)- Ao* X14 (A,- Ag+ As) + Xe in 
terms of b[ A 1,A 2A 3,X1,X9]. 

The Fundamental Formulas. Summarizing, Eqs. (10-27) and (10-30) 
give a Boolean matrix representation for Boolean algebraic transforma- 
tions. Clearly, however, if we are given the transformation and the 
original function F, the problem of finding the transformed function E is 
rather trivial and can be solved algebraically by direct substitution. 
The importance of Eqs. (10-27) and (10-30) becomes clear, however, 
when the nontrivial problems associated with solving for F or for R are 
considered. These considerations lead us to the next theorem. 


TurorEM 4. If a Boolean function F(fi, ... ,f7,X1, ... ,Xx) is 
related to a Boolean function E(A,, . . . ,A1,Xi, . . . ,Xx) by means of 
the transformation f; = fi(A1, ...,A7),...,f7 =fs(Ai,... ,An, 
then 

F@OR=E (10-33) 


where F and EF represent the respective designation-number matrices for 
the functions F and EF as described above and R is the unitary permuta- 
tion or incidence matrix related to the transformation as described above. 
Furthermore, if R and E are known, then F can be found as satisfying 


F* — F 5 F’ 
where F'’=EF@R and F* = EF @ Rt 


} Note that this is essentially a Veitch chart for F, See E. W. Veitch, A Chart 
Method for Simplifying Truth Functions, Proc, Assoc. Computing Mach,, May 2-3, 
1952, meeting, Pittsburgh, Pa., pp, 127-133, 


(10-34) 






























































































































































































































































482 DATA-PROCESSING TECHNIQUES [Cuap. 10 


If no such F exists, then 


F'@R-E-F*@R (10-35) 
If on the other hand F and E are known, then FR can be found from 
an * 
ee eS ay ae (10-36) 
where R=F@EH and R*=F@E 


If no such R exists (ie., if R’ or R* has a column of all zeros), then 
F ® R’- EF ® R* (10-37) 


Proof. The proof of Eq. (10-33) was discussed above ; Eqs. (10-34), 
(10-35), (10-36), and (10-37) follow directly from applying Theorems 2 
and 3 to Eq. (10-33). 

Equations (10-34) and (10-36) are called the fundam nial formulas, 
because they are the equations most frequently utilized when applying 
Boolean matrices to logical circuit design and other problems. 

Boolean Algebraic Equations. Consider a Boolean equation of the 


general form 


GAy, 6 04 AY Aus 2 ky) = BAe eo ,Ar,X1, .. + ,Xw) (10-38) 
where the unknowns Xi, ... ,Xw are to be found as functions of the 
Ay, ..., Ar so that the equation will hold. From the meaning of 


the ‘solution to an equation,” we desire to find functions fi, ... , 
fy such that, of 


Xi=fi - oe , Xn = fy 


then Eq. (10-38) holds; that is, f functions such that x = fi, . an 
Xy = fw imply that Eq. (10-38) holds. Recasting this in terms of our 
matrix formulation, if we let / represent 


(Xi = fi): (Xo = fe) 0 0 (Xw = fw) (10-39) 


and let E represent Eq. (10-38), then we want F, with the appropriate 
transformation of each f; in terms of the A;, to imply E—that is, 


F@R-E (10-40) 


Here F and E are known, and the determination of R will give the 
desired functions f; in terms of the A;. Thus the matrix R is given by 
all R such that R— R’, where R’ is determined from the left-hand form 
of Eqs. (10-36). 

To find the matrix for F (or EZ) we must first observe that the designite 
tion number for the general form P = Q has a unit only in those positions 
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where both P and Q have units or both P and Q have zeros. Then, for 
example, with respect to b[f1,f:,X1,X2] we have 


1010 1100 1000 

cs! =. _ { 0101 \ { 1100) _ { 0100 
(Xi = fi): (X2 = fr) = i) G = (3) (10-41) 

0101 0011 0001 


A further important observation can be made from this example, namely, 
that the matrix for a function F of the form (10-39) will in general be 
“diagonal,” i.e., will have units only on the diagonal. An immediate 
consequence is that the solution to Eq. (10-40) as given by the left-hand 
form of Eqs. (10-36) reduces to 


R’=E£ 


Thus we have the following corollary: 

Corotiary 4a. To solve a Boolean equation such as Eq. (10-38), 
form the matrix E for the equation, and then the solution fi, . . . , fy 
is simply given by all unitary matrices R such that 


RoE (10-42) 
To solve simultaneous Boolean equations Ey, Hy, ..., Em we find 
unitary matrices R such that 
Rashes eee Ko 


10-13. Application to Biochemical and Intelligence Problems 


Logical Problems. In the examples that follow, the logical translation 
of English will be subordinated to the application of the matrix com- 
putational methods, once the logical transition has been accomplished. 
We shall be interested in the truth value of those sentences about which 
it is meaningful to say that they are trwe or false. Thus A, might mean 
“It was enzyme 1” is true; X, might mean ‘‘There is a dome radar’’ is 
true. The conjunction and corresponds to the logical operation -, or 
(that is, the inclusive or) to +, implies to the relat’on —, and not, or 
negation, to the complementation operation ~. The designation number 
corresponds to a truth table, where 1 means true and 0 means false. 

The majority of logical problems involve given or accepted premises, 
facts, rules, or other logical relationships, which comprise the statement of 
“given functions” of the problems. There are two types of “solutions” 
to a set of given functions, the antecedence solutions and the consequence 
solutions. Antecedence solutions are hypotheses or theories from which 
the given functions can be deduced; consequence solutions can be deduced 
Jrom the given functions, In other words, the truth of the antecedence 
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solutions is sufficient for the truth of the given functions, but the truth 
of the consequence solutions is necessary for the truth of the given func- 
tions. Finding anteeedence and consequence solutions is usually trivial; 
hence most logical problems require antecedence and consequence solu- 
tions of a specified form. Figure 10-13 illustrates this point and relates 
such problems to our Boolean matrix methods by means of the funda- 


mental formulas. 


Antecedence Given Consequence 
solutions equations solutions 
F(f, see SnX1, one Xx) H(A, oe ,ArX1, one XK (fh, see ta,X1, see ,Xx) 
where fi =Sfi(Ay .. - AD) 


In this case In this case 


F@QR-E E-F®@R 
F’=E @R F* =E @R! 
R’'=F QE R*=F @E 


Fig. 10-13. Relationships between antecedence and consequence logical problems, 


Analyzing Experimental Results. In biochemistry an enzyme often 
cannot be isolated easily, and therefore experiments performed with one 
enzyme frequently will involve several others. Consequently a combina 
tion of reactions must be observed. In these cases our logical computite 
tional methods can be extremely useful for evaluating experimental 

_ results and planning future experiments to yield the maximum informae 
tion. Suppose that a chemist is studying enzymes Aj, A», As in relation | 
to reactions X,, Xe, Xs. Suppose that he has done the following four 
experiments: 

E,: A solution containing none of Ai, A:, A; produced reaction X 
but not X, and not X3. 

E»: The solution contained A; and either A» or A3, or both—he could 
not be sure. The reaction was neither X2 nor the combination of Xy 
and X3. 

E3: The solution had A2 but not A;, or else it did not have A» but had 
As. React’ons X; and X, occurred; or else reaction X; occurred, but Xy 
did not. 

E,: The solution was obtained from a source that had A;, and A, or Ay 
or both, or else had neither Ai nor A; the solution turned color, which 
meant that X, did not take place or that both X» and X; did. 


In terms of symbolic logic these experiments can be interpreted thus 
; 





Ey: Ay: Ay: As = Bi Xa Xs 
Es: Ay: (Ar+ A3) = Xe (Xi Xs) 
Es: . + Ay: As = Xi: Xot+ Xi: Xs 


ne 
Ey: Ay: (Ai+ 2) + Ay: Ay = Xi + Xa Xs 
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The problem is: What theories about the enzymes associated with each 
reaction will explain the experimental results? In other words we wish 
to find antecedence solutions of the form 


ee fi(A1,A2,A3)] - [Xe = fo(A1,A2,A3)] - [Xs = fs(A1,A2,A3)] 


T firs $s p 1S to determine the four desi nation-number matrice 
he t te £ Ss for 


0111 1111 1101 0001 
Ol11 1111 1101 0001 
1000 0000 1101 0001 
0111 1111 0010 1110 
BE, = E; = 
0111 1111 0010 1110 
0111 1111 1101 0001 
0111) 1111 0010 1110 
Ol11 1111 0010 1110 
0001 0101 1010 0111 
0001 0101 0101 1000 
1110 1010 1010 0111 
1110 1010 0101 1000 
Es = By, = 
0001 0101 1010 O11] 
1110 1010 0101 1000 
1110 1010 1010 0111 
1110 1010 1010 0111 


Next, b i i 
ne re of the special form of the solution /’, we can utilize Corol- 
0000 0001 
0001 0000 
1000 0000 
0000 1000 
R— E,-E,-E;- E, = 
0000 0100 
0100 0000 
0010 0010 
Le : 0010 0010 
he result is given by 














Pais 
o-oo 
-On 
eee 
fetes 
cocor 
ed 
i) 
— ht et 
eee 
oco 


There are thus four sets of soluti 
Phere ts ‘ ions, one set for e 
Et pn es or each of the four pos- 


X, = fi = Ai: Ay + Ay: As, or Ay: (Ay + As) + Ay: Ag: As, 

or A+ (Ay + Ay) + Ay: As, or Ai: As +A ‘A ‘A 
Xiao 1 2 8 
Xam fem Ay: Ay+ Ay An 
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The first two sets (i.e., using the first two possibilities for X)) are logically 
independent. The last two sets of solutions imply that the reactions are 
not independent but that X2° X;— Xi and that X1-X2°X3= 0, 
respectively. These observations may lead to additional experiments 
for distinguishing between the four possibilities for X1. 

Analyzing Intelligence Reports. Suppose that the Blefuscudians’ 
missile force has three kinds of intercontinental ballistic missile sites, 
which we shall denote by Ai, A2, and A;. Also, let X1 represent their 
Hatlis missile, X» represent their dome radar, and X; represent their Bor 
missile. Suppose that military intelligence has determined that the 
distinguishing features which identify the three types of sites in terms of 
the equipment at the site are as follows: 


Ey: Ay = Xi: Xst+ X2'Xs 
Es: A, = Xo: Xs+Xi-Xs 
Ks: Ag = X1°Xo+ X2° Xs 


These are the given information-function equations F. Now it happens 
that, on a high-altitude V-3 flight over Blefuscudia, observations of a 
certain area showed that there were either Hatlis missiles or Bor missiles 
or both, one could not be sure, and there was no dome radar—i.e., the 
observer reported (Xi + Xs) ° X,. Suppose in addition it is known that 
a missile site of type A» never is in the same area as a site of type Ay 
that is, As—> A2 The problem then is: Assuming at least one missile 
site is present, what types of missile sites are there in the area? In more 
precise form, the problem is: What are the consequence solutions of the 
form F to the given intelligence information-function equation E? 


F: (X, + Xs) ° Xo = f(A1,A2,A3) 


To solve this problem, find 


10 0000 1000 
01 0100 0000 
10 0010 0000 
10 0001 0000 
Fe= E= 
OL 0000 0010 
ol 1000 0000 
10 0000 0001 
10 {0000 0100 


whence from the right-hand form of Eqs. (10-36) we have 


——— (ool 1101 
ba Ot (oe wal) 


Thus “f = (1100 0010) 
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However, at least one type of site must be in the area—that is, A1 + Ax + 
A; is a constraint; similarly A;— A, is treated as a constraint. Now 
the conditions corresponding to the zeros of the designation number of a 
constraint cannot occur. Thus the solution is given by 


(A, + Ao + As) (As Ap): (f) 

= (0111 1111): (1111 1100) - (1100 0010) 
(0100. 0000) 
= A, Ax: As 


In other words, from the observations, together with the given intelligence 
information, it can be deduced that only an A, missile site is in the area. 
10-14. Exercises 


Suppose that A is false, B is true, and C is false; determine the truth value of the 
following combined elements: 


Answer 
10-1. A -B-6 True 
10-2. A-B+A- B True 
10-8. (A-B+A-B)-C False 
10-4. A-C True 
10-5. A+B False 
10-6. (A +B)-C+(A4+C)-6+4-B-C False 


10-7. Note that A — Bis true if both A is false and B is false. Can you think of 
examples of this situation that might occur in real life? 
10-8. Consider the sentence ‘A + Aisatautology.” Is this tautologically true, or 
is it factually true? 

10-9. Using J: A = A and (8) and (15) of Sec. 10-3, prove (18). 

Solution 

A+A-B=(A4+A)-(A+B) =1-(A +B) 

A+B 


as desired. 
10-10. What other equations listed in Sec. 10-3 would be used to prove (19)? 
Solution. (1), 2), (3), (6), (7), (8). [Hinr: Start with the right-hand side of (19) 
and apply (8), using (1), (2), (3), (6), and (7) to simplify the terms.] 


Are the following tautologies? 


Answer 
10-11. 4: Bo A No 
10-12. A> A+B Yes 
10-18. (A— B) = (B= A) Yes 
10-14. (A-B-C— D) = (D-B- C= A) Yes 


10-15. Define / and 0 in terms of p, as given in Sec. 10-3, 
10-16. Show that the following is a tautology: 


pilpr (apr lMel) prCprlurpaleul pal pileywlpilpr (ey), pre) 
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10-17. Show by set diagrams that A + B-C = 


[CuHap. 10 


(A +B)-(A+C). 


If a point is in set A, not in set B, and not in set C, then is it in the following sets, 
i.e., in the sets for which the following combined elements are true? 


Answer 
10-18. (A + B) -¢ Yes 
10-19. A+B-C Yes 
10-20. A-B-C No 
10-21. A- (B+ C) Yes 
10-22. B+ 4-C Yes 
10-23. Draw the set diagram for A > B. 
10-24. Draw the set diagram for A = B. 
Find the truth tables for: ; 
10-25. (A-B) + A. 10-26. A-C. 10-27. A-B-C. 
10-28. A + B. 10-29. (A-B+A-B)-C. 


10-30. Show, by means of truth 
A + A are tautologies. 


tables, that A-B + (A + B)-(A +B) and 


With respect to standard bases determine the following: 





Answer 

10-31. #(A + B) 1101 

10-32. #(A- B) 0100 

10-38. #(A-B+C) 1111 0100 

10-34. #[A-(B 4+ C)] 0101 0100 

10-85. #[A-B+4-B| 1001 

10-36. #{[A- B+ A-B 0110 

10-37. #{[A-B+C-D| 0100 1111 0100 0100 
10-88. #[A-(B + 0) + 4- (B+ 0)| 1101 1011 

10-39. #[A4-B-C +A-B-C} - 1000 0001 

10-40. #{((A +B+C)-(A+B+40)] 0111 1110 

10-41. #[4-B-C+C-(D+B-E)+D-E| 0011 0100 1111 0100 


Find the first and second canonical 
numbers: 


10-42. 1001 0110 
0101 


10-43. 0101 


10-44. 1000 0001 


1111 1111 1111 O1 


form corresponding to the following designation 


Answer 


#[4-B-C+A-B-C+A-B-C 
+ A-B-C| ' 
#[((A+B4+C)-(A+B+4+0C) 
-(A+B+C)-(A+B+C) 
#[4-B-C+A-B-C+A-B-C 
+4-B-C) 
#[((A+B+C)-(A+B+C) 
-(A+B+C)-(A+B+0)] 


#([A-B-C+A-B-C| 

#((A +B Py C):(A+B+4+0C) 
ae +C)-(A+B+4C) 
+ ( +O)-(A+B4O) 





i ea ae | 
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10-45. 0111 1110 #([A-B-C+A-B-C+A-B-G 
+A-B-C+A-B-C+A-B-C 

#(A+B+C)-(A+B+4+6)] 

10-46. 1011 #(4-B+A-B+A-B] 

#[(A + B] 

10-47. 0010 #(A-B) 

#((A +B): (4 + B)- (4 + B)] 

10-48. 1011 0001 0110 0101 #[A-B-C-D+A4-B-é. 
+A-B-C-D+A-B-C-D 
+A-B-C-D+4A4-B-6-D 
+A-B-C-D+4A-B-C-D| 

#[(A+B+C+D)-(A+B+C6+4+D) 
(A+B+C+D)-(A+B+C+D) 
(A+B+C+D)-(A+B+C+D) 
(A+B+4+C+D)-(A+B+4+C+D)| 


Write down directly mongrel forms for: 


10-49. 0110 1001 (A-B+A-B)-C4+(A-B+4-B)-C 

10-60. 1001 0110 0110 1001 (A-B+A-B)-(C-D+6G-D) 
+(A:B+4A-B)-(C-D+6-D) 

10-51. 0010 0100 0010 0010 (A-B)- (6+ D)+(A-B)-(C-D) 

10-52. 1010 0010 0010 0010 A-C-D+A-B 

10-53. O111 1011 1101 1110 (A +B)-C-D+(4+4+B)-C-D 
+(A +8)-C-D+(A+B)-C-D 

10-54. 1011 0111 1011 O111 (A+ B)-(6-#) + (A 4+B)-C-B 


1111 0000 1111 1111 +(C+ D)-E 
Find, by the method of Sec. 10-7, the simplest sum-of-products forms of the following 


designation numbers: 


Solution 


10-55. 0111 0101 
10-56. 0110 1001 
10-57. 0100 0100 1111 
10-58. 0010 0100 0010 
10-59. 0011 1010 1001 


A+B. 
A-B-C 
0100 A- B+ 
0010 
0010 


+A-B-C+A-B-C+A-B-C 
C.D 


1000 1101 0100 1011 


The simplest product-of-sums form is a product of sums that has the least number of 











operations + and - 


ing designation numbers. 


the units.) 


10-60. 1001 
10-61. 1111 


10-62. 0100 
10-63. 0110 
10-64. 0001 


Find the simplest sum-of-products forma of the following designation numbers by 


1010 
0011 


1001 
0101 
0011 


Find the simplest product-of-sums forms of each of the follow- 
(Hinr: Work as in Sec. 10-7, but with the zeros instead of 


0001 


1101 
1001 
O11 


the method of Sec, LOB: 


0110 
1010 
1111 


-(A+B+C)-(44+0¢) 


Solution 

























































































-(B+C+D)-(A+B+D) 


























































































































490 
10-65. 0111 0101. 
10-66. 0110 1001. 
10-67. 0100 0100 
10-68. 0010 0100 
10-69. 0011 1010 


Find the simplest product-of-sums forms of each of the following designation num- 
bers, using the method of Sec. 10-8, but working with the zeros instead of the units: 


10-70. 1001 1010. 
10-71. 0100 1001 
10-72. 1111 0011 
10-73. 0110 0101 
10-74. 0001 0011 
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1111 0100. 
0010 0010. 
1001 0010 1000 1101 0100 


1101 
0001 
1001 
O111 


0110. 
1000. 
1010. 
1111. 


1011. 


Find the constrained basis for the following sets of constraints: 


10-75. A-B =0 


10-76. B-C =0 


10-77. A+ B-C=0 


A:-C=0 


10-78. A> C 
CB 


10-79. A. (B-C +B-C) =0 


A-B>0O 
B+D=1 


Find the constraints corresponding to the following constrained bases: 


10-80. #A = 010 


With respect to the constrained basis given in Sec. 10-9, find the design at) 


#B = 001 
10-81. #A = 111 
#B =010 
#C = 001 
10-82. #A = 0001 
#B=O0111 
#C = 0011 
numbers of: 
10-83. A-B+C 
10-84. A-(B+C) _ 
10-85. A- -C+A- 
10-86. A-(B+C) + 


Answer 
#A = 010 
#B = 001 
#A =0101 
#B = 0011 
#C = 0000 
#A=11 
#B=01 
#C = 00 
#A = 0001 
#B =Ol111 
#C = 0011 
#A = 0100 
#B = 0010 
#C = 0001 
#D = 0000 
Answer 
A+B 
A-B+A-C 
A-C+B-C 


Answer 


1111 Ol 
0101 Ol 
1000 00 
1101 (10 
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01 
00 
11 


000 
111 
101 
O11 


ll 
~ 





Src. 10-14] 


With respect to the constrained basis given in Sec. 10-9, find the simplest sum-of- 
products forms of: 


10-87. 0101 
10-88. 0111 
10-89. 1010 
10-90. 0101 
10-91. 0111 
10-92. 1010 


Are the following sets of functions independent? If not, find the logical dependence. 


10-93. 


10-94. 


10-95. 


Prove the following useful properties of Boolean matrices: 
10-98. (A @B) @C=A@Q(BOC)/=ABOBOLC. 
10-99. (A+B) +C=A+4+(B4+C)=A+B+0C. 
10-100. (A-B)-C=A-(B-C)=A-B-C. 

10-101. (A @ B)t = Bt @ At. | 
10-102. If A— Cand B— D, then A ® BoC @ D. ‘Ih 
10-103. A’ = 


10-104. A-B=A+B,A+B=A-B. 


10-105. If A 


Find the Boolean transformations that will transform F into E: 


10-106. F = 


E= 


Solution, ti - Ay + Ags Ay + Aa (Ay Aa + Ai Aa), fo = Ave Aas Ay + Ai- Aa, 


f= Ay: Ay, 
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Answer 
01 A 
01 A+B |W 
01 A-C+A-C 
0100 A 
0101 A+B 
0101 A-GC+A-C 


Answer | 
+A-B No | 
+A-B 

Yes 

Yes 

-B)-C+(A-B+A-B)-C 
+A4-C Yes 
TBC 
B)-C+A-B-C 
+A-C No 
+B-C 


At. | 





— Bthen B= A. 

















(fi + fs) - Xi: Xo Rs + fie fo: Xi Xo- Xs + fo + fs): Ki- Xo Xs 
+ fie X1+Xo- Ret fifo: Xi Xe Xa thie fe: Xi- Xo Xs 

+ fo: Ki -X2-Xs + (fie fo tfe-fi) » X1-X2- Xs. 

((Ay + As) + As + As (Ai+ As + Ai: Av)» Xi» Xa- Xs 

+ (Ay- As + Ar: A+ As) -X1- X2- Xs 

+ ((Ai + As) + As + As- Ai) - Xi -X2- Xs 

+ (A,- Ay: As + As: (Ar» Ap + A+ An) /X1-X2- Xs 

+ (Ag> As + Ar: Ag+ Ay) +R.» Xo Xs + Ar: Az As: X1- Xe Xs 
+ (A, + Ag: Ay + Ay: As) + Xi + Xa+X3 + Ags Xi Xa- Xs. 
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-107. F = X,-Xo-f, + Xi + Xo- fs + X1- Xo - fo + Xi-X2- fo + hi: fs) 
cai Pek Lok PaaS + A;-(Ai- As +_Ai- A:)) 
+X,-X.- A; + X1+Xo- (Ag+ (Ai + Av) + As: (Ar + A2)) 
+ X1-X2-(As-(Ar- Aa + Ar: As) + Ar A2- As), 
10-108. F = X,- Xo: (fo- fs +fi- fe fs) + X1-Xo- (fifo fs + fs fh + fo)) 
+ X.-Xo- (fi fee fa + fe fs) + X1-Xo- (ff fs + hi: fe). 
E = X,-X,-(A,- Ay As + Av As) | = 
+X,-X2-(A1- Ap: As + As+ (A1- Ap + A> A:)) 
+ ®.-X2- (As: (Ai- Ay + Ar: A2) + As+ A2> Ar) 
+ X,-X.-(As- Ag+ Ai + As: (Ai- As + Ai: A2)). 


Solve the following sets of simultaneous Boolean equations for the unknowns 
X and Y: 


10-109. (A +B)-X+A-X-V=A,4-¥+X=A4+B,X-V>B. 
Solution. X= A,Y=A-B+A-B;X =A, Y =A+4+B. a 
10-110. X-Y=A-B,A+B=A-¥+4+X,(A+B)-X+A4-Y =A, 
Solution. X =A,Y =A-B+A-B. 

it ¢-Veo deh Lie oe hoy ae A ee ee, 


No solution. Why? 
10-112. ¥-(A +B) +Y-(A +40) 
X-(A-B+A 


There are four sets of solutions: 


X=A-€+(A+4+8)-C 
Y=C+A-B+A-B 


X =(4+B)-C€+(A+B8)-C 
Y=C+A-B+4+A-B 


ll 


ll 


= (A +B)-C.+(A +.B)-C 
C+ (44+ 8)-C 


ht bd te 


10-113. Solve the following set of simultancous implication equations: 
1. (B+A4+D+C)-YoX 

2A-B4+4-B>X-Y — _ 
3. X +Y3A-C4+A4-B-C-D 


4. 
5. A-B-C-YOX ee us 
There are three pairs of solutions to this set of five implications: 


‘i /) = Po; = As Bae 
A-C-(B+D) X=A-B-C - xX A 
C-(A-D+B-D) Y=C-(A+B8B.-D) Y=C-(A-D+B-D) 


x 
Y 


10-114. Find fi = fi(Ai, 42), fe = fo(A1,42), where 


h, = Ao + Az gn =hichs f=zHan tg 
hs A, + Az g2 =h-h fo =g2 +9 
gs =h-h. 


ll 


10-115. Find fi = f1(91,92,93), fe = f2(g1,92,gs) so that the following relations will he 
satisfied: 
hy = Ap + Az gi = hi-ho 
ho = Ai + Az g2 = hy-hs 
gs =h- he 
+ fo) + X1- Xo + (fi + fe) -X,- Xo + (fiefe thi + fa) Xi Re 
2: A; + Ai- Ay-X1) > X2 + (Ar + An + As) Ri Xe 


‘ 


= 
( 


pap oh 


F 
E= 
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10-116. All Solutions to the Boolean Matrix Equation. We introduce a new kind of 
matrix-multiplication operation, called the theta product, defined as follows: (S;;) = 
(P3x) © (Qz:) if and only if for each position 


Si: =|] (Pix = Qui) 
k 
= [] (Pin: Qe: + Bye Oe) 
k 


Show that given the matrix equation 
(diz) ® (taj) = (b:;) 
—wherei =0,1,...,/;7 =0,1,... ,J;andk =0,1,..., K—if 
(Sj) = V(diz) @ (b[Xy,X1, 2. Xx) }¢ 0 (b:;) 


then when (S,,;) is interpreted as an R-type matrix, all solutions to the matrix equa- 
tion are found. 
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CHAPTER 11 


SEARCHING, SORTING, ORDERING, AND CODIFYING 


11-1. Introduction 


‘Ingredients and Definitions. The ingredients of searching are items 
and characteristics and the associations of characteristics with items. An 
item is a package of information; the characteristics associated with this 
item are attributes that are related to the item. Problems of searching 
all involve locating an item, or items, associated with a given set of char- 
acteristics. For example, each article reference included in a bibliogra- 
phy is an item; the subjects relating to the contents of the article are the 
characteristics. A bill of sale is an item; the name and address of the 
buyer may be considered as the characteristics. A research grant is an 
item; the name of the principal investigator, the amount of the grant, 
the subject under investigation, the name of the school or organization 
receiving the grant, etc., are all characteristics. The inventories of 
particular replacement parts are the items; the serial numbers, costs, 
acquisition record, etc., are the characteristics—and so forth. 

Given a set of characteristics, the process of obtaining every ztem associ- 
ated with all the given characteristics is called searching. In a bibliographi- 
cal list of research reports one might search for all articles concerning the 
application of nuclear theory.t The words application, nuclear, and 
theory are the characteristics, and the search should result in a collection 
of references, each concerned with the application of nuclear theory. 
As a specific illustration, consider the ztem list of article references and 
the characteristic list of subjects of a sample bibliography given in Fig. 
11-1. We have given each item a boldface number and each character- 
istic an italicized number. The ztem-characteristic associations are given 
in Table 11-1, where a unit indicates that the characteristic of the row 
of the unit is associated with the item of its column. Consider the rows 


t In this chapter we neglect any theory of semantics or meaning associated with the 
characteristics. For this important problem see, for example, Y. Bar-Hillel, Some 
Theoretical Aspects of the Mechanization of Literature Searching, Tech. Rept. 3, 
ONR, Contract No, N62558-2214, April, 1960; and A Logician’s Reaction to Recent 
Theorizing on Information Search Systems, Am, Document., vol. 8, no. 2, pp. 
118-118, April, 1957 (with comments, pp, 103122), 
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Article 

No. 

1.1. Abrahams, A. P. 
Autoradiographic 
determination of 
radioactivity in rocks. 
Nucleonics 15:85-86 
Mar 1957 

1.2 Aravindakshan,C. A 
simple arrangement 
for obtaining optical 
transforms of crystal 
structures. J. Scz. 
Instr. 34:250 Jn 1957 

1.3 Gasstrom, R. V. A 
-very fast pulse-height 
analyzer with inde- 
pendent uptake, sort- 
ing, and storage of in- 
formation. Nuclear 
Instruments 1:75-79 
Mar 1957 

1.4 NBS Circular 850. 
Bibliography on igni- 
tion and spark ignition 
systems. Nov 1 1956 

Word No. 

adaptability 3.2 

analysis 4.1 

application 5.1 

concept 6.2 

counting 2.1 

design Vere | 

differentiation 3.1 

England 5.2 


Fia. 11-1. Illustrative bibliography example. 


Article 


No. 
2.1 


2.2 


2.3 


2.4 


Nicholls, J. Alpha- 
scintillation monitor 
for hands and cloth- 
ing. Nucleonics 
15:80, 81, 83, 84 Mar 
1957 


Pope, M.I. An 
automatically record- 
ing vacuum balance. 
J. Sci. Instr. 34: 229- 
232 Jn 1957 


Powell, D. A. An 
apparatus giving 
thermogravimetric 
and differential ther- 
mal curves simul- 
taneously from one 
sample. J. Sci. ° 
Instr. 34:225-227 Jn 
1957 


Seidle, F. G. P., et al. 
Modification of the 
Brookhaven fast 
chopper. Nuclear 
Instruments 1:92-93 
Mar 1957 


Word No. 
evaluation 6.1 
gas 2.2 
heat 2.8 
hysteresis be § 
implementation 5.3 
instrumentation 45.3 
mass 1.2 
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No. 
3.1 


3.2. 


3.3 


3.4 


Word No, 
Netherlands 4.2 
nuclear 4.8 
plan 6.2 
theory 4.4 
thermal 2.8 
use 1 
versatility 8.2 
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Senior, D. A. The 
Kerr cell, a high 
speed electro-shutter, 
Pt. Il. Instr. Pract. 
11:471-476 May 
1957 


Smith, B. O., and 
Grimshaw, A. G. A 
pneumatic level indi; 


cator. Instr. Pract. 
11:469-470 May 
1957 


Stockendal, R., and — 
Bergkvist, K. E. - ; 
Evaporation device . 
for beta-spectrometer 
samples. Nuclear 
Instruments 1:53-54 
Jan 1957 


Tove, Per-Arne. 
Electronic time ana= — 
lyzer applied to the 
measurement of the 
half-lives of meta- 
stable nuclear states. | 
Nuclear Instruments 
1:95-100 Mar. 1957 
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associated with the given characteristics. Form a new row that has a 
unit corresponding only to those (column) positions where all the given 
characteristic rows also have units, ie., logically multiply these rows. 


TaBLeE 11-1, AssocrATION oF ITEMS WITH CHARACTERISTICS 














| Item 
Characteristic - 
| 112 12 13 14 21 22 23 24 3.1 32 33 3.4 
17 0 0 0 0 0 0 0 0 0 0 0 1 
1.2 0 0 0 0 0 1 0 0 0 0 0 0 
2.1 1 0 0 0 0 0 0 1 0 0 0 0 
2.2 0 0 0 1 0 1 0 0 0 0 0 0 
2.3 0 0 0 1 0 0 1 0 0 0 0 0 
3.1 0 0 0 0 0 1 1 0) 0 0 0 1 
3.2 1 0 0 0 0 0 1 0 0 0 0 1 
4.1 0 0 1 0 0 1 0 0 0 0 1 1 
4.2 0 0 1 0 0 0 0 1 0 0 1 1 
4.3 0 0 0 0 1 0 0 1 0 0 1 1 
4.4 0 0 1 0 0 0 0 1 0 0 1 1 
5.1 0 0 0 0 0 1 1 1 0 1 0 ] 
§.2 0 1 0 0 0 1 1 0 1 1 0 0 
§.3 0 0 0 1 0 0 1 1 1 0 1 0 
6.1 1 1 1 0 0 1 0 0 1 0 1 0 
6.2 0 i 0 0 1 0 0 1 1 1 af 0 
7.1 0 1 0 1 1 1 1 0 1 1 0 0 
en ae 


The units of the resulting row correspond to the columns of the desired 
utems. For example, given application (5.1), nuclear (4.3), theory (4.4), 
we logically multiply rows, 





5.1 0000 0111 0101 
4.8 0000 1001 0011 
44 0010 0001 0011 
obtaining 0000 0001 0001 


with units in columns 3.4 and 2.4 corresponding to items by Tove and 
Seidle. However, it is rarely feasible to display a matrix such as in 
Table 11-1; we have used it simply as ‘an illustration to aid the visuali- 
zation of the searching methods to be described. 

It is difficult to discuss searching without using the terms sorting, 
ordering, and codifying, It is often helpful for temporary use to sub- 
stitute a symbolic abbreviation to represent an item or characteristic. 
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For example, the use of boldface.and italicized numbers in our illustrative 
bibliography made the table of associations easier to display. A code 
is a list of such abbreviations; the making of a code is codifying.t Fre- 
quently the code is given in the form of a “dictionary,” or “thesaurus.” 
In our illustrative bibliography the code for the item (article) list is pre- 
sented in dictionary form, and the code numbers are listed in numerical 
order. The characteristic (word) list is in thesaurus form, and the word 
list is in alphabetical order; here synonyms are given the same code num- 
ber. Codes are used in searching as follows: First the thesaurus of 
characteristics is compared with the list of desired characteristics, and 
the corresponding code numbers are found; next the search is made to 
find the code numbers of the resulting items; finally the actual items are 
determined from the dictionary. In computer searching the use of codes 
in this manner is almost always indicated. Further discussion of codes 
appears in Sec. 11-7. 

By ordering a list is meant the process of arranging the list according 
to some linear ordering system, such as alphabetically or numerically, 
so that, if element Q of the list is greater than element P, then element Q 
lies further from the beginning of the list than P. By sorting a list is 
meant the process of separating the list into partitions, i.e., mutually 
exclusive classes. In sorting items the equivalence relation by means of 
which the classes, or partitions, are defined is often given in terms of 
associated characteristics. For example, research grants may be sorted 
according to the university that received the grant, etc. Searching may 
be defined as the process of sorting a list of items into two categories, 
those items which are associated at least with all of certain given char- 
acteristics, and those which are not. Frequently the classes, or par. 
titions, in a sort are ordered. For example, collections of research 
grants sorted by university can be ordered alphabetically by the uni- 
versity’s names. 

Searching and sorting are both closely associated with ordering. For 
instance, to sort research grants by university, all that need be done ig 
to order the grants alphabetically by university, and then the result 
will present all grants from the same university juxtaposed in the list; 
hence the grants will automatically be sorted as the partitions of the sort 
are ordered. 

Summarizing, to sort a list of elements, some equivalence relation 
must be defined that tells when two elements are in the same equivalence 
class or not. To order a list linearly, an ordering relation must be 
defined. Sorting can often be accomplished by means of ordering, when 
the ordering relation is so chosen that the identities of some type of chat. 


+ We use the term codifying here in preference to coding to distinguish the proces 
being discussed in this chapter from the programming of a computer. 
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acteristic represent the desired equivalence relation. As we shall see 
in the next section, searching is often carried out by sorting ordered lists. 

Generalized Concept of the Searching Problem in Terms of Boolean 
Algebra. The searching problem can be easily couched in terms of 
Boolean algebra. This is accomplished by observing that an item is 
uniquely defined (at least as far as the search is concerned) by the pat- 
tern of characteristics that are and are not associated with it. With 
three characteristics A1, Az, and A; a total of eight items Xo, Xi, Xo, 
X3, X4, X5, X6, and X; can be distinguished, 


Ba 
ma 
mr 


Xo = 
X= 


3 X, = A,- Az: A; X2= 
3 Xs; = Ai: Aa: A; Xe = 


1: Ay: A; X3 = A,: Az: A; 
1: 42: Ag X; = A,: Az-+ Az 


1° 


1: Ay: 


2° 


> BB, 


mi 
ma 


Here, for example, X; = Ai: A,- As means that item X; is associated 
with characteristic A; and A; but not with A». If the basic symbols are 
chosen to be the characteristics, then the columns of the basis can repre- 
sent items distinguishable by these characteristics, where a unit in a row 
of a column indicates that the item of that column is associated with the 
characteristic of that row and a zero means that it is not. Thus 


Xo Xi X2 Xs Xs Xs Xe Xz 
#A,=0 10 1 0101 


#A.=0 01 1 0011 
#A;=0 0 0 0 1111 


The simple searching problem is then solved as follows: To find all items 
associated with A,- Ae, simply form 


Xs Xr 
#Ai,-A2.=0 0 0 1 000 1 


whence X; and X; are the desired items. Of course there is not always 
an item associated with every combination of characteristics; in such a 
case the columns for which there is no associated item are omitted from 
the basis, i.e., a constrained basis is used. Thus Table 11-1 can be con- 
sidered as a constrained basis. 

With this interpretation the searching problem can be generalized as 
follows: Suppose that it is desired to find all items associated with 
characteristic A, and at least one of the characteristics A» or As, that is, 
associated with A,- (A, + A;3). We simply form 


whence the answer is Xy, Xs, and X;. As another illustration, consider 
Table 11-1, and find all articles associated with nuclear theory or nuclear 
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plans or both. Observing that 


Nuclear = 4.3 = 0000 1001 OO11 
Theory = 4.4 = 0010 0001 O011 
Plans = 6.2 = 0100 1001 1110 


we have (4.8) - [(4.4) + (6.2)] = 0000 1001 O01L 




































Hence articles 2.1, 2.4, 3.3, and 3.4 by Nicholls, Seidle, Stockendal, and 
Tove are the final desired results. Summarizing, when it is desired to find 
all items specified by a Boolean function of the characteristics, the designation 
number of the function is formed; the positions of the units indicate the 
desired items. 


11-2. Methods of Searching 


The First Two Methods. For concreteness we assume in what follows 
that the items and characteristics are listed on magnetic tape. Following. 
this assumption, we shall describe three methods for searching with a com= 
puter, using our illustrative bibliography as an example. Each of the 
searching techniques is, as will be shown, intimately associated with the 
listing arrangement of items and characteristics on the magnetic tape; 
this is because the associations are indicated by means of the item-character- 
istic arrangement on the tape. 

First, consider the case where all characteristics associated with each 
item of the item list appear under that item (see Figs. 11-2a and 11-3a), 
Given a set of desired characteristics, the computer reads an item and 
its associated characteristics from the magnetic tape into the high-speed 
memory and compares to determine whether or not the given character» 
istics are included in the characteristics associated with the item; if 
so, the item would be suitably recorded. This process is accomplished 
for each item and its associated characteristics on the tape, one item 
at a time successively. In this way, after a single pass through the 
tape, every item associated with all the given characteristics is retrieved, 
The advantage of this system is that the items need not be ordered on the 
tape, since every item is searched. The disadvantage is that every sét= 
tion of the magnetic tape must be searched in detail, consuming much 
time in the computer. 

Second, consider the case where the characteristic list is recorded on 
the tape and under each characteristic is a list of items associated with 
that characteristic (see Figs. 11-2b and 11-3b). The characteristics must 
appear in some order on the tape—e.g., alphabetically or numerically 
Given a set of characteristics, the computer would first order them @ 
then would go through the magnetic tape looking for the first, or small 
characteristic; having found this, it would read the characteristic and 
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(a) (d) (c) 


Fie. 11-2. Methods of listing on magnetic tape. 










1.2 
5.2, 6.1, 6.2, 7.1 





3.2 















1.1, 2.3, 3.4 2.4 | 5.1, 5.3, 6.2 


1.3 41 3.4 | 5.7 

4.1, 4.2, 4.4,6.1 1.3, 2.2, 3.3, 3.4 3.3 | 5.3, 6.1, 6.2 
13 | 62 

1.4 4.2 

2.2, 2.3, 5.3, 7.1 1.3, 2.4, 3.3, 3.4 5.1 


2.3 | 5.2, 5.3, 7.1 
2.2 | 5.2, 6.1, 7.1 
3.2 | 5.2, 6.2, 7.1 


2.1 
4.3, 6.2, 7.1 


4.3 
2.1, 2.4, 3.3, 3.4 












2.2 4.4 2.4 | 5.3, 6.2 
1.2, 2.2, 3.1, 4.1, 1.3, 2.4, 3.3, 3.4 3.4 

5.1, 5.2, 6.1, 7.1 

2.3 5.1 


2.3, 3.1, 3.2, 5.1, 
6.2, 5.3, 7.1 


2.2, 2.3, 2.4, 3.2, 3.4 








(a) (0) (c) 


Via, 11-3, Sample segments in the lists for each of three methods. 
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associated items into the high-speed memory and record all items associ- 
ated with it. Continuing the search on the tape, the computer would 
look for the second, or next smallest, characteristic. Having found that, 
the computer would see which items associated with the second character- 
istic were also associated with the first and record them. Then the 
computer would look for the third characteristic and record only those 
items associated with it which were also associated with both the first 
two characteristics, and so on. The computer need not examine the 
whole tape in detail, but only those sections associated with each of the 
given characteristics; however, both the list of characteristics and the set 
of given characteristics must be preordered. 

Note that, in terms of our association matrix of Table 11-1, the first 
and second methods described above correspond, respectively, to record- 
ing the successive columns of the matrix and the successive rows of the 
matrix on the magnetic tape. 

The Tabledex Method. Finally, consider a third method for recording 
the items and characteristics on the magnetic tape. The method involves 
the recording of successive tables on the magnetic tape and is therefore 
called the Tabledex method. There is one table for each characteristic, 
A table is formed as follows: All items associated with this characteristic 
are listed in the table; adjacent to each item are sublisted all those char- 
acteristics greater in order than the characteristic of the table which are 
associated with that item (see Figs. 11-2c and 11-3c). For example, in 
Fig. 11-3c, consider the Tabledex table headed 4.4. Observe from Table 
11-1 that the characteristic 4.4 is associated with items 1.3, 2.4, 3.3, and 
3.4. Now observe that item 2.4 is associated with 2.1, 4.2, 4.3, 4.4, 1, 
5.3, and 6.2, but only 5.1, 6.3, and 6.2 appear in the sublist, for only 
these are greater than 4.4, the number of the table (see Fig. 11-3c). Simi 
larly the sublists for 3.4, 3.8, and 1.3 were formed for this table. (Note 
that these sublists are ordered in the table on their first characteristics.) 

The tables are listed in the order of their associated characteristic, 
and the items and the sublist of characteristics for each item are also 
listed in the proper order. Given a set of characteristics, the computer 
first puts them in the proper order, then searches the tape for the table 
associated with the lowest ordered characteristic, reading this into the 
high-speed memory, and finally searches the sublists, recording those 
items associated with a sublist containing all the other given characters 
istics For example, if the three given characteristics are 7.1, 5.1, and 
5.2, these would first be written in order as 4.1, 5.2, and 7.1; then the 
table for 5.1 would be searched; and finally, if a sublist were found to 
contain both 5.2 and 7.1, its item would be recorded. As can be seen in 
- Fig. 11-3c, the items whose sublists contained 5.2 and 7./ are 2.3, 2.2, and 
3.2; hence these are all items each of which is associated with all the give 
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characteristics. Using the Tabledex method, only the single section of 
the tape that contains the appropriate table need be searched in detail. 

Population vs. Substantive Ordering of the Lists. The reader may have 
wondered how we arrived at the code number for the characteristics 
and items given in Fig. 11-1. The choice of numbering is related to the 
concepts of population-based ordering and substantive-based ordering. 
Substantive ordering refers to an ordering arrangement based on some 
attribute of the individual terms in the list. Thus an alphabetical order- 
ing of items by author is a substantive-based ordering, an ordering of 
characteristics by research-field classification is substantive-based, etc. 
In Fig. 11-1 the item-list code numbers were assigned substantively to the 
articles, in alphabetical order; in addition the number of an item locates 
its column and line, e.g., item 2.3 is in column 2, line 3. 


1.1 
(a) 3.4 B81 8.2 4.1 4.2 438 44 81 


Fl 
1.2 
14 
(b) 2.1 
2.2 
2.3 
3.1 
3.2 








Fic. 11-4. The first and last Tabledex tables for the bibliography of Table 11-1. 


Population-based ordering is quite different: it refers to an ordering 
based on the quantities of ttems or characteristics in the associations. For 
example, population-based code numbers for each characteristic might be 
assigned as the number of items associated with it. To distinguish 
among characteristics that have the same number of associated items, a 
decimal number can be appended. ‘The characteristics of our example 
are numbered in this way. Thus in Fig. 11-1 it is seen that analysis, 
Netherlands, nuclear, and theory each have four associated items; hence 
their population-based numbers are 4.1, 4.2, 4.3, and 4.4, respectively. 
Ordering the characteristics by population-based code numbers can save 
searching time. In the second method given above, it is most efficient 
to begin searching in the section of the smallest given characteristic 
when population-based numbers are used. (Why?) Similarly, in the 
Tabledex method when population-based numbers are used, tables that 
include many items have short sublists, and tables that have long sub- 
lists include few items. (Why?) For example, Fig. 11-4a illustrates the 
first and Vig. 11-4b the last Tabledex table for the bibliography of Table 
11-1, for which the characteristics code numbers are population-based. 
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11-3. Manual Searching Methods 


For each of the three searching methods given above there is an 
analogous manual searching method. Besides being of considerable 
importance in themselves, these manual methods can serve as a con- 
crete visualization of the more abstract operations that take place in a 
computer. 

Marginal-punch Cards. The first method given above corresponds to 
the use of marginal-punch cards, where each card represents an item and a 
marginal position on a card represents a characteristic. Different cards 
represent different items, but the same respective marginal position repre- 
sents the same characteristic on each card. The association of character- 
istics with an item is made on its 
ecard by punching a notch in the 
marginal position of those character- 
istics which correspond to the par- 
ticular item of the card. One type 
of card commonly used has holes 
corresponding to each marginal posi- 
tion. To sort a deck of the cards 
according to a particular character- 
istic, a needle is placed through the 
holes in the cards corresponding to 
this characteristic; then, when the 
needle is lifted, those cards which 
have been punched in this position 
will fall, the other cards will be lifted 
by the needle, and the deck is 
thereby separated into two parts 
(see Fig. 11-5). A search is pers 
formed by repeating this process 
successively on the dropped cards (which have the desired characteristi¢) 
until those cards punched in at least all the desired characteristic positions 
are obtained. Note that the initial deck of cards need not be kept in any 
particular order whatsoever. 

Peek-a-boo Cards. The second method given above corresponds t6 
the use of the so-called ‘‘Peek-a-boo cards.” Here each card corresponds 
to a characteristic; the surface of a card is laid off in an XY coordinate 
system with each particular pair of coordinates representing an item, 
Different cards represent different characteristics, but the same respective 
XY coordinates represent the same item on each card. The associatio 
of characteristics with items is made by punching a hole at the X¥ 
coordinate positions that represent each item associated with the cha 


Punched notch 
representing an 
association 





Fie. 11-5. Marginal-punch cards. 
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teristic of the card. A search is made by first selecting from a deck of 
cards those cards which correspond to the given desired characteristics. 
The selected cards are placed together to form a pack; if an item is associ- 
ated with all the given characteristics, then there will be a hole passing 
completely through the pack at the coordinate point corresponding to 
that item (see Fig. 11-6). Note that the initial deck of cards must be 
kept in some definite order so that the cards can be easily selected for a 
particular search, and note that the pack of selected cards must be 
carefully aligned. 

Programming a Computer to Generate a Manual Tabledex Collection. 
The third method given above corresponds to the manual use of the 
Tabledex tables themselves. Here the tables are printed in conventional 





Fia. 11-6. Peek-a-boo cards. 


bound-book form. The use of a collection of tables is directly analogous 
to their use on magnetic tapes. For example, suppose that we have a 
collection of tables corresponding to our bibliography illustrated above 
and that we desire to find all articles on the application of nuclear theory. 
I’'rom the alphabetical word list the code numbers are found: application 
5.1, nuclear 4.3, theory 4.4. These are placed in numerical order: 4.3, 
4.4, and §.1. One then turns in the collection of tables to Table 4.3, 
which would appear as shown: 





TaBLe 4.3 
| 
VY 2.4 4:4. 6.1 6.8 6.2 
Vv 3.4 4.4 6.1 
3.3 4.4 68 6.1 6.2 
2.1 CB 74 


The first two rows of Table 4.8 contain both 4.4 and 4./ and are checked. 
Hence articles 2.4 and 3.4 by Seidle and by Tove are associated with the 
given words, The advantage of the manual use of Tabledex lies in its 
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being in bound-book form, without the necessity for the manipulation 
of cards. 

A computer can be programmed to automatically compile and print 
a Tabledex collection for manual use. The method used to compile the 
Tabledex collection is of particular interest as an application of the 
techniques of this chapter. First the characteristics of each article- 
characteristic association are ordered in themselves. For example, we 
would have the article-characteristic association: item 2.4, character- 
istics 2.1, 4.2, 4.3, 4.4, 6.1, 6.3, and 6.2 (see Table 11-1). Second, from 
each such article-characteristic association, a list of all possible rows of 
the tables can be generated. For example, from the associations just 
illustrated the following seven rows can be generated: 


2.4 2.1 4.2 4.8 4.4 5.1 6.3 6.2 
2.4 4.2 4.8 4.4 5.1 5.3 6.2 
2.4 4.8 4.4 6.1 5.38 6.2 

2.4 4.4.6.1 6.3 6.2 

2.4 6.1 5.8 6.2 

2.4 6.3 6.2 

2.4 6.2 


Third, the list of all such rows generated from every article-characteristi¢ 
association is then sorted by ordering on the first (leftmost) characteristie 
code number. After the ordering, all rows with the same first character 
istic appear adjacent. For example, the following rows appear adjacent ; 


2.4 5.8 6.2 
3.4 
3.3 
2.1 


When the number 4.3 has been removed, the adjacent rows are just Tabl@ 
4.3. Thus from this ordered list of rows the tables can be printed oul 
by the computer. 


11-4. Searching with Relaxed Conditions 


The Problem of Relaxed Conditions. It frequently happens that ni 
item is associated with all the characteristics given. In such a case it I 
usually advisable to search with relaxed conditions. 

For example, suppose that there are given 10 equally important chit» 
acteristics and that no item is associated with all of them. Then the 
natural question to ask is: Are any items associated with all but one 
of the characteristics, i.e., with any 9 of the characteristics? If not, then 
are there any items associated with all except 2 of the characterisation, 
i.e., with any 8 of the 10, and so forth? We shall describe a procedure 
accomplishing this important kind of search which does not involve t 
all possible combinations of 9 of the 10 characteristics, then all pos 
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combinations of 8 of the 10, ete. This procedure can be extended to include 
a second type of relaxed-condition problem which may arise, as follows: 
Suppose that 3 of the 10 given characteristics are most important and 
that it is desired to find all items associated with these 3 characteristics 
and also any 6, or any 5, etc., of the remaining 7 characteristics. The 
procedure can be easily generalized to include almost all relaxed-condition 
situations that occur in searching. 

The idea of a simple sort can be used to compare searching techniques. 
A simple sort is a separation of a collection of items into two parts by 
means of a single characteristic C, that is, into a part whose items are 
each associated with C and a part whose items are not. A simple sort is 
analogous to the manual separation of a deck of marginal-punch cards 
into two parts by means of a single needle. 

To see the advantage of the searching technique to be described over 
the straightforward method of trying all possible combinations of leaving 
out one of the characteristics at a time, then two at a time, then three at a 
time, .. . , then K = n — r at a time, note that the total number of 
simple sorts required for the straightforward method is 


n!} 
_ — ! 
iM (n—k-— 1)! 


However, the number of simple sorts required by the method to be pre- 
sented is simply 
(K + 1)(m — K) 
2 





Searching for Items Associated with Any r Out of n Characteristics. 
The method of procedure is illustrated by Fig. 11-7. Suppose that there 
are four given characteristics, represented by C1, Cs, C3, and Cy. We 
would first sort the collection of items for those associated with C;. Next 
we would sort those which have C, for those which also have Cs. Then 
those having both C; and C2 would be sorted for those also having C3, 
and so forth, until we find all items that have C1, C2, C3, and Cy. In 
ig. 11-7 we indicate the collection of items that are associated with at 
least Cy by C1, those associated with at least C1 and Co by C1- C2, ete. 
The collection of items that do not have C; is indicated by C,, etc. Then 
C,:C2+@s would represent the collection of items that at least have 
C, and C, and not Cs. Thus the results of each successive sort of the first 
pass through the collection of items are indicated by the first row of 
Vig. 11-7, Here, in the first sort, the collection of items is divided into 
two parts, the part C; and the part @;. Then the collection with Cy 
is sorted for Cy, resulting in a part Cy + Cy and a part Cy + Cy, and so forth. 
After the first pass there will result five collections Cy, Cy Cs, Cy: Co+ Os, 
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Cy: Cy-C3+ Ca, and Cy: Cy C3 + Cs; all but the last of these will be used 
if another pass is necessary. 

If the collection C;-C2-C3-C4 is empty, ie., if no items are asso- 
ciated with all of Ci-C.-+C3-Cs, we proceed to the second pass (the 
second row of Fig. 11-7). This time we start by first sorting C; for C., 
obtaining C.-C. and C,: C2. We put the collections C,- Cy and C,- C2 
together; these are sorted for C3, and two collections are obtained, one 
with Cy * Ce 2 C3 and Cy * Cs * ope the other with Cy . Cy . C3 and Cy . C; 2. C3. 
This latter collection is put with C,- C.-C and these sorted for C4 to 
obtain two collections, one with Cy: C2: C3- Cs, C1-C2° C3° Ci and 
Cy : Cy C3 . Ci, the other with C, : C2 : C3 . C4, Cy : Co , Cs; : Cy and 
C,:C2-C3:Cs This latter collection and collection C1: C.: C3: Cs 
form the collection of all items having all except one of the given charac- 
teristics. If this collection is empty, then we proceed to a third pass, as 
shown in the third row of Fig. 11-7, to find the collection of items with all 
except two of the characteristics, and so forth. 

Searching with Permanent Conditions. Suppose that there were seven 
characteristics given, Ci, Co, . . . , Cz, and that no item of the collection 
under consideration were associated with all these seven characteristics. 
The next step would be to relax the conditions, but suppose that we must 
have items associated with Cs, C's, and C7, and with as many of the others - 
as possible. The characteristic combination Cs5-C.s-Cz is called a 
permanent condition. The searching technique would be first to sort the 
collection to obtain all items associated with C;:Cs:C7. Then we would 
sort these on Cy, C2, C3, and C4, as described above, to obtain all items 
each associated with Cs - C's - Cz and three, or two, or one of Cy, Co, Cz, Cu 

As another illustration of a permanent condition, suppose that we 
desired to obtain all items associated with one or more of C3, Ce, and Cy, 
and with as many of C,, C2, C3, and Cy as possible. In such a case wé 
would first sort for C's, sort the remaining items for Cg, and then sort those 
still remaining for C;. These three sets would be put together to form 
the collection of items having one or more of C5, Cs, and C;. We would 
next sort this collection on Ci, C2, C3, and C4, as described above, to 
obtain all items each of which is associated with C;, Cs, and/or C;, and 
three, or two, or one of Ci, Co, C3, and Cy. In either of these examples, 
if no items were to have the permanent conditions, no further sorting 
could be accomplished. 

Searching with Preferential Conditions. Suppose that there were nin 
characteristics given, C1, C2, . . . , Cs, and that no item of, the collection 
were associated with all nine given characteristics. However, suppond 
that a preference were associated with the characteristics so that, say, 
C1, Co, and C3 have the most preference, (4, Cs, and Cy have the n 
preference, and C;, Cs, and Cy have the least preference. The sear 
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Fie. 11-7. Analysis of method of searchin 
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ing procedure would first try for all of Ci, C2, and Cs, or at least two 
of C1, C2, and Cs, or at least one, etc., as described above. Having chosen 
the items associated with the greatest number of the most preferred 
characteristics, these would be considered a permanent condition and 
would be sorted for all of C4, Cs, and Cz, or at least two of Cu, Cs, and Cs 
or at least one, etc. From the items having the most preferred charac- 
teristics we would now have chosen the items with the most characteristics 
of the next preference. Considering these as representing a permanent 
condition, we would try next for the items with the most characteristics 
of the least preference, that is, Cz, Cs, and C5. 

Note that the process of this section would be the natural one to follow 
in the problems of the previous paragraph when no items satisfy the 
desired permanent condition. The initial permanent condition would | 
then be stated in terms of preferential conditions, and the process of this 
section would be used to obtain an approximation to the initial permanent 
condition. 

Relaxing Sets of Conditions. Suppose that it is desired to search for all 
items associated with at least one of Ci, C2, and C3, and with at least one 
of C4, Cs, and Cy, and with at least one of Cz, Cs, and Cs, and with at 
least one of Cio, Cui, Ciz. Suppose that no such item exists; then the 
problem is to find all items that have at least one characteristic from at 
least three of the four sets of characteristics, or from at least two of the 
four sets, and so forth. Here we wish to relax sets of conditions. In 
order to facilitate the discussion, let us represent condition C1, C2, and/or 
Cs by C!, condition C4, Cs, and/or Cs by C?, condition C7, C's, and/or Cy 
by C3, and condition Cio, Cu, and/or Cy, by C*. Then the searching 
technique becomes‘as described and illustrated above, with superscripts 
replacing subscripts, and with the single sorts replaced by the combina« 
tions of sorts necessary to determine the C*. 


11-5. Sorting and Ordering 


As discussed in Sec. 11-1, sorting can always be accomplished by order« 
ing; in this section we shall turn our attention to ordering. Sorting and 
ordering are essentially the same process, except that ordering carries 
the process all the way, while sorting carries the process only part of the 
way. In our discussion of ordering we shall refer to the ordering of & 
given list of numbers in order to be concrete, but it is clear that the 
methods apply for any linear ordering rule. 

Minimum- or Maximum-in-pass. The first of the five methods of 
ordering to be presented is called the minimum-in-pass method. The 
description will be given in terms of the use of magnetic tape, but t 
principles involved are perfectly general. In this method, on the fi 
pass over the tape the minimum number is recorded as follows: ‘The first 
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number is recorded in a temporary cell and is compared successively with 
the other numbers until a smaller number appears. The first number 
replaces this smaller number on the tape as this number is placed in the 
temporary cell. The successive comparisons are continued until another 
smaller number appears; again the two numbers are interchanged, and so 
forth. At the end of the first pass it is clear that the smallest number 
appears in the temporary cell. This is placed as the first number on the 
tape, and the second pass is started with the second number on the tape, 
proceeding as before. At the end of this pass the second smallest number 
appears in the temporary cell and is placed as the second number on the 
tape, and so forth. The ith pass starts with the 7th item, determines 
the 7th smallest number, and puts it in the 7th position on the tape. For 
N numbers this method requires N — 1 passes through the tape; how- 
ever, if the tape can be positioned, starting each pass with the next 
number in turn effectively reduces the number of passes by half, to 
approximately N’/2. 

It is obvious that the maximum instead of minimum number could have 
been recorded on each pass (maximum-in-pass) and placed at the end 
of the remaining list; this method would again require about N/2 passes. 
If both the minimum and maximum are recorded in each pass, the latter 
being placed on the tape at the end of the pass, the former at the beginning 
of the next pass, the number of passes is again halved, to approximately 
N/4. Depending on the size of the high-speed memory, the least two, 
or least three, or least m numbers, and the greatest two, or greatest three, 
or greatest n numbers could be found on each pass (and ordered within the 
high-speed memory), further reducing the number of passes required. 

If more than one tape is available, the minimum-in-pass method can 
be accomplished in parallel. The original list to be ordered is divided 
into sublists (say n; in number). A new list is made from the minima 
which are removed from each of the n; sublists. This new list is divided 
into ne, sublists, and a third list is made from the minima which are 
removed from each of the nz sublists. As many new lists as desired can be 
made, and the first number of the ordered list will be the minimum of 
the last such list. As a number is removed from a list, it is replaced by 
the minimum from the sublist from which it was originally taken (until 
the sublist is exhausted). Table 11-2 illustrates the process applied to an 
original list of 12 numbers, divided into n; = 4 sublists. The minima 
from these are divided into n. = 2 sublists; their minima are then divided 
into ny = 1 sublist, whose minimum goes into the final ordered list, 
labeled nq. 

Transposition. The second method orders a list by transposition. 
The process is to transpose adjacent numbers on the list if the order of 
their characterisation ia inverted, It starts by comparing the first two 
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numbers and continues on overlapping pairs through the list. Pass after Taste 11-3. ORDERING BY TRaNsposiTIoNt 
pass is made until no transpositions are required on an entire pass; then 
the list is in order. For example, consider Table 11-3 (where the caret 
represents comparison, the ¢ transposition). Here three passes were 
necessary to order the numbers. The fourth pass will result in no 
further transpositions, indicating that the list isin order. A variation on 


this method is to order successive overlapping sets of three or more 






















































































































(say r) numbers; a single tape may still be used in this process, and the 
number of passes required is approximately divided by r — 1. Table 
11-4 illustrates the procedure for overlapping sets of three numbers. 
TaBLe 11-2. Mrnimum-rn-pass OrpDERING Usinc MUutTIPLE SUBLISTS 
! 
Ny Ne Nz 4 Ny Ne N3 Nas Ny Ne Ng M4 M1 Ne Ng M4 Ny Ne Nz M4 
12 12 12 12 12} 6 
ah3 3} : of 8 | 8 6) 1 
6 2 3 6 , | 109) [2 
2 at 9 10} 9 1 10} 9 2 3 
10 9 ; 2 : 18) |4 
g\2 }.8) |3 18 J 5 
2 4 
4 4 { 4 4 5 
1 8 i 5 7 5 1] 
1 4 11 11f * 
47-1 7 
8 uf? 
I 
B 
75 
11 + Successive overlapping pairs are scanned. 
Initial First Second Third Fourth 
minima replacement replacement replacement replacement 
selected of minima of minima of minima _| of minima, ete, TaBLe 11-4, ORDERING BY TRANSPOSITION WITHIN OVERLAPPING SETS 












oF THREE NUMBERS 


Ordering by Radix Coefficients. The third method is probably the most 

































































12) 3 33 3]1 1 
obvious of all the methods when a clearly defined radix occurs; in the dE 9 di; ; i} 2 ; 2 
present discussion we shall consider decimal numbers (i.e., radix ten), 10 10 an 2 22 if 4 44 if 4 ; 
The method consists in sorting the numbers by the most significant figure, - ’ an ? toad i 6 6 6 f 5 5 
putting the sorts in order. The numbers within each sort are ordered 1 1 an 8 : ; if 7 : - f ; 
on the second most significant figure; then within each of these subsorts the 4 4 fi 5 5 5 | 8]8 8 8 | 8] 8 8 
numbers are ordered on the third most significant figure; and so forth, 4 , ff ih a : | A 4 . | ra 
For an example, see Table 11-5. This method has the disadvantage that 7 ; 7 ak aS : y # wh 
after the first pass the computer must work on subregions of the tape, it jie 12 12 


which can present difficulties. For sorting punch cards, for example 
this means that individual sorts must be made on successively smal 


4 
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and smaller pieces of the original deck; such handling becomes increas+ 
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TaBLe 11-5. ORDERING BY THE THIRD METHOD 





Sort by Sort by Sort by 
Original most 2d most 3d most 
list significant | significant | significant 
figure figure figure 

345 256 256 255 
344 265 255 256 
537 255 265 264 
256 264 264 265 
554 2 345 337 337 
447 344 345 344 
337 337 344 345 
265 347 347 347 
255 - 447 447 446 
466 466 446 i 447 
446 446 466 464 
464 464 464 7 466 
347 ie 537 537 = 336 
264 554 536 337 
536 536 544 544 
544 544 554 554 














ingly inconvenient. 


[Cuap. 11 
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Suppose that for the fourth method ten bins, or magnetic-tape units, 
in addition to the unit holding the original list, were not available for the 
separation by least significant decimal digit. For example, suppose that 
only 5 units were available, 1 of which contains the original list. Then 
we could convert the least significant decimal digit of each number to 
two radix-4 digits and in two passes order the list by the least significant 
decimal digit. Similarly two passes would be required for the next least 
significant decimal digit, and so forth. 


TaBLe 11-6. ORDERING BY THE FourTH Mrruop 








1 
Sort by Sort by Sort by 
Original least 2d least 3d least 
list significant | significant | significant 
figure figure figure 
345 344 536 255 
344 | 554 537 256 
537 464 337 264 
256 264 344 265 
554 544 544 337 
447 345 345 344 
337 265 446 345 
265 255 447 347 
255 256 347 446 
466 466 554 447 
446 446 255 464 
464 536 256 466 
347 537 464 536 
264 447 264 537 
536 337 265 544 
544 347 466 554 














11-6. Ordering by Merging 


Merging. The fifth method is based on the simple idea of the merging 


of two ordered lists. Yor example, consider these two ordered lists of 
numbers: 


























The fourth method, which is essentially the inverse of the third, elimis 
nates this difficulty. Here the first sort is made on the least significant 
figure. Then the entire tape is sorted on the second least significant 
figure, and so forth. For an example, see Table 11-6. However, both 
these methods offer the difficulty that simultaneously used multiple tape 
units are indicated. Of course the subsorts might be made by one of 
first two methods, whence the entire sorting procedure becomes a mixttl 
of methods. 


‘ 






Forming a single ordered list containing all the numbers is accomplished 
by successively comparing the first numbers on each list and removing 


265 255 
337 264 
446 and 464 
466 536 

544 


the smaller number to a third list, 


Thus, since 255 < 265, 255 becomes 
the first number of the new list; now 264 becomes the first number of the 
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second list, and since 264 < 265, 264 becomes the second number of the 
new list; next, since 265 < 464, 265 becomes the third number of the new 
list; and so forth. In this manner the two ordered lists are merged to 
form a single new ordered list, namely, 


255 
264 
265 
337 
446 
464 
466 
536 
544 


In more mathematical notation, if the two ordered lists are a, < a, < 

- <a, and bi < be < - ++ <by;, then the merged list, namely 
Ci < C2 < ++ + < Caryn, is formed, as indicated in the flow diagram of 
Fig. 11-8. 


C,,= min (a; ,b;) 





Fia. 11-8. Flow chart for merging two ordered lists. 


The initial step of the general ordering by merging process is to separate 
the given list into two lists in a manner that preserves any ordering that 
might have originally appeared among adjacent numbers. The first 
number of the given list is used to start the first of two generated lists, 
Then successive numbers of the given list are added to this new list so long 
as they appear in the proper order. The first number encountered that 
appears out of order is used to start the second generated list All 
successive numbers that are in the proper order are placed successively in 
this second list. When a number out of order again appears, it is pl 
in the first list, with those following numbers which are in the pro 
order, and so forth. Thus in forming the two new lists we switch out, 


‘ 
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lists whenever a number appears out of order, so that any order within a 
string of adjacent numbers in the given list will be retained in one of the 
two lists. For the example below, strings of numbers in the original list 
that appear in the proper order are included in braces; these are alter- 
nately placed in the two generated lists. 


Original list New list New list 
{345 {345 fee 
{ 344 { 256 537 
537 554 {447 
Vee {337 {265 
554 ae ee 
{447 466 464 
{337 {347 264 
{265 {ss 
{ 255 544 
466 
iA 
464 
{347 
264 
fg 
544 


When this initial step is finished, the ordering. proceeds as follows: 
Kach of the two lists contains sublists that are ordered, and each sublist of 
one can be paired with a sublist of the other, except perhaps the last sub- 
list of one of the lists (see the horizontal lines below). The pairs of 
ordered sublists are then merged. The merged results are again placed 
alternately} in two new lists as shown below: 


New list 








First merge results New list 
345 mete 344 344 256 
537 345 447 
256 447 537 554 
554 merBe 265 255 
337 265 337 264 
merge 446 446 466 
464 464 536 
255 264 = 
466 merge 536 347 
544 
347 


} The alternation of the merged sublists in the new lists ensures the maximum 
number of pairings of sublists for the next merge; if the total number of sublists is 
known, the same result would be obtained by putting the first half on one tape, the 
second half on the other, 
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The process is repeated, forming each time two lists of increasingly 
longer ordered sublists, until all the numbers appear in one ordered list: 


Second merge Third merge 


results results Ordered list 

256 255 255 347 255 
344 264 256 256 
345 265 264 264 
447 337 265 265 
537 446 337 337 
554 464 344 344 
466 345 345 

536 446 347 

544 447 446 

347 464 447 
466 464 

536 466 

587 536 

544 537 

554 544 


554 


Often four magnetic tape units are used in this ordering process, 
Two of the tape units hold the old lists, and the other two record the 






Continue to 
merge sublists 


Check for end 
of a sublist 
Are both a;>a;,, 
and b> bi41? 


Use 
other 
tape unit 








Start to merge 
next pair of 
ordered sublists 










Place next number of 
unfinished sublist 
on output tape 








Fic. 11-9. Flow chart for ordering by merging. 


results of the merging. Then, when both old lists have been exhausted, 
the newly formed lists become the old lists and the pairs of tape whl 


‘ 
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exchange functions. In merging two sublists, the end of a sublist is 
sensed when a; > a:41; when both sublists are exhausted, i.e., when both 
a; > a;4: and b; > bj)41, then the merge starts on the next pair of sublists, 
the results being put on the other receiving tape unit. Figure 11-9 gives 
a flow chart for this process. 

Variations on Merging. The technique of ordering by merging two 
ordered lists is easily extended to ordering by merging more than two 
ordered lists. The initial separation of a list to be ordered can involve 
more than two new lists, and then the merging of ordered sublists would 
be accomplished successively with one sublist from each. If the initial 
step formed m new lists and the merging of sublists from these formed 
n new lists, then m + n magnetic tapes would be involved. Below we 
illustrate the case for three initial new lists (i.e.,m = n = 3): 





Ordered 
list, second 
Initial step First merge merge 
Original list New list New list New list results results 
{345 es ee fe 256 255 264 255 
bi 447 537 554 344 265 536 256 
537 Vine 337 i 345 337 544 264 
fee 466 fe 347 447 347 265 
554 464 537 446 337 
{447 264 554 464 344 
{337 fe 466 345 
{265 544 347 
{ 255 446 
466 447 
best 464 
464 , 466 
{347 536 
264 537 
536 544 
544 554 


Another variation takes further advantage of any intrinsic ordering 
in the original list. Here a tape is not closed off as a smaller item occurs, 
but rather is held open to receive any larger item later in the list. Each 
new item is compared with the final items on each tape, in order of largest 
to smallest, and is put on that tape whose last item is as large as possible 
without exceeding the new item. If the new item is smaller than all the 
final items, it is placed on the tape with the largest item to start a new 
sublist, Using this technique to sort the above list of numbers, merging 
from two lists, gives the following (with the sublists indicated) : 
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First merge Second merge 
Initial step results results 
{ 345 ee 344 ( 256 256 255 
587 554 345 337 337 264 
256 {337 537 J 447 344 265 
Vue 255 554 464 345 347 
265 {i 255 ee 447 446 
vee 464 265 { 264 464 536 
347 {264 446 347 466 544 
536 536 537 
{ 544 554 


The third merge, instead of the fourth required above for two-tape 
merging, will give the ordered list. Note how changing tapes for the 
italicized numbers allows the lengthening of the sublists. 

Comparison of Ordering Methods. The number of passes, i.e., the 
number of times one must look through all the numbers of the original list, 
is very large for the first two methods as compared with the merge 
method. The straight minimum-in-pass method requires N /2 (or N/4 
or perhaps less depending on the equipment) passes for N numbers. The 
merge sort using 27’ magnetic tape units requires no more than K + 1 
passes, where K is the smallest integer such that TX > N. The merge 
sort has the additional feature that full advantage is taken of any ordered 
sublists that may appear in the original list, and if the numbers are ran- 
domly distributed, there ought to be about N/2 ordered sublists. In 
such a case 1 + K’ passes are required, where K’ is the smallest integer 
such that 7%’ > N/2. For the transposition method there can be as 
many as n(n + 1)/2 passes, but here again any ordered sublists that may 
appear in the original list can reduce this number of passes. The varia- 
tion of the transposition method is of course more efficient. Ordering 
by the radix-coefficient methods can compete with the merge sort when all 
numbers from 1 to N are in the list, with practically no ordered sublists, 
But if the numbers have more than K digits, where K is the smallest 
integer such that R¥ > N, where R is the radix being used, then the 
merge sort would require fewer passes. Since the radix-coefficient 
methods do not take advantage of any ordering that might already exist 
in the list, the merge sort can be still more advantageous. However, if 
only a single tape is available, only the minimum-in-pass or transposition 
can be done. 

Of course the number of passes required for ordering a list is not 
necessarily the only criterion applied in deciding on a method for ordering, 
The nature of the equipment available, e.g., how many magnetic-tape 
units, or bins, are available and whether these can or cannot be used con« 
currently, how complex operations can be performed, etc., often limits the 


types of methods that can be used. The number of items or numbers 
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to be sorted, the number of times such a process must be repeated, 
etc., all enter into the choice of the ordering system. In general a 
combination of ordering techniques can be used to take advantage of par- 
ticular situations that may occur at the different levels of the ordering 
process. However, if we were asked to choose which system is of greatest 
importance, we would say the merge ordering method and its variations 
have the most advantages. 


11-7. Codifying : Error Correction and Superimposition 


Definitions. By codifyingt is meant the process of relating to an item 
or characteristic a set of symbols, called a codification. As we have seen 
(ef. Sec. 11-1), codifying is intimately associated with searching, sorting, 
and ordering. Codifying is used for three purposes: first just for con- 
venience in handling the information in a shorter form rather than in raw 
form; second for making the information compatible with a particular 
mechanical or electronic processing device being used; and third to facili- 
tate the particular method of searching, sorting, or ordering being used. 

In general, codifying involves two processes, called encoding and decod- 
ing. Encoding is the proeess of generating the codification related to a 
given item or characteristic; decoding, conversely, is determining the 
items or characteristics related to a given codification. The processes 
of encoding and decoding are accomplished by means of either a code book 
or a relation scheme. For example, a telephone book is an encoding 
code book that gives the codification (i.e., the telephone number) related 
to an item (i.e., a person); the decoding is accomplished after dialing, by 
means of a scheme built into the telephone company’s electrical dialing 
equipment. Another example is the biological codification of the genes, 
where the encoding is the evolutionary process and the decoding is the 
ontogenetic process that results in, say, an individual animal. 

The codification of items or characteristics can be either substantive or 
nonsubstantive in character. A substantive codification is one in which 
the symbols have meaning in themselves. A substantive codification 
of names, for example, might comprise simply the first four consonants 
of the last names with the initials. A word in a dictionary is a sub- 
stantive codification of its meaning. The genes, of course, represent 
an outstanding example of substantive codification. 

For a nonsubstantive codification consider the example of the codifica- 
tion of research reports by accession numbers, or the codifying of an 
item in a bibliography by the page and line number on which it is found. 
Nonsubstantive codifications are frequently merely addresses that tell 
the locations of the related items or characteristics, An exception to 
this is the population-based (nonsubstantive) codification considered in 

| See footnote on p, 408, 
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Sec. 11-2, in which the symbol for a characteristic is the number of items 
associated with that characteristic. 

The relationships between encoding, decoding, and information proc- 
essing are summarized in Fig. 11-10. For example, consider our illus- 
trative bibliography discussed above. Here the encoding was the 
relating of population-based numbers to the given characteristics (i.e., 
the given retrieval words) by means of a code book. The processing was 
searching, where, for instance, the Tabledex method might be used, 
which resulted in a collection of numbers related to the associated 
items (i.e., the page and line numbers of desired article references). 
Finally the decoding process was accomplished by locating the items 
at their address (i.e., the entries are retrieved from their page and line 
numbers). The information processing here consisted in transforming a 
set of characteristic codifications into a set of item codifications. 













Information processing 
(searching, sorting, 
ordering, etc.) 














Decoding 
process 


Encoding 
process 










codifications 


codifications 





Fria. 11-10. Relationships between encoding, decoding, and information processing. 


Efficiency of Codifications. To determine how many symbol posi- 
tions are required in an unambiguous codification using K different 
symbols, we consider each symbol as a number in the base (radix) K and 
use enough symbol positions to count the number of items in the base K, 
That is, with N items (or characteristics) and K symbols, the number of 
symbol positions in a codification must be no smaller than the smallest 
integer n such that Kn > N (or n > logx N) if the codification relating to 
an item is to be unique in the list. For example, for 600 items and a 
binary codification (that is, K = 2), we havet 
2.7782 

3 


0 = 9.261 





n = loge 600 = 


orn = 10. If the number of positions is greater than n, that is, if more 
positions are being used than are necessary, the codification is called 
redundant. If fewer than the required n positions are used, the codifica- 
tion is called irredundant and the code for an item is not unique in the 
list. A codification that is neither redundant nor irredundant is called 
efficient. 

For example, an English word can be considered as a codification of 
an idea using the 26 alphabetical symbols. Suppose that we have & 


+ Recall that logx N =logio N/logio K. For K = 2, logs N = logio N/logio % 
where logio 2 = 0.3. p 


‘ 
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dictionary all the words of which are no longer than 10 letters. We can 
make all the words the same length by introducing a 27th letter, say a, 
and filling up with a’s all positions to the left of a word to make 10 
positions altogether, if necessary. It is then possible to have more 
than 145 X 10! different words in our dictionary. However, a desk 
dictionary has only approximately 100,000, or 10°, words, and so a dic- 
tionary is redundant. Then again, that’s to be expected. 

In the next paragraphs we shall consider a use for purposely redundant 
codification; that discussion will be followed: by consideration of a use 
for purposely irredundant codifications. 

Self-correcting Redundant Codification. Frequently situations arise 
where errors may be introduced into a codification during storage or 
transmission. If errors cannot be tolerated, a redundant code can be 
devised with the extra positions of the redundancy used to correct such 


















Redundant 

























Original Redundant Information- Cc t 

| Redundant codificati Orre 
information codification Storage or ton | carrying information- 

carrying generator | codification transmission] that may codification carrying 
codification have errors | "egenerator codification 





Source of 
possible errors 


Fig. 11-11. The self-correcting redundant-codification process. 


errors as may occur. However, the self-correcting scheme that makes 
use of redundant positions must also be able to correct errors that may 
occur in these extra redundant positions as well. The process can be 
outlined thus: First from the given information-carrying codification 
there is generated the redundant codification. This redundant codifica- 
tion is stored or transmitted, etc., during which time some errors can 
enter into the codification. Finally, when the information is to be used, 
there is generated from the redundant codification (which may contain 
some errors) the correct information-carrying codification in spite of any 
errors in the redundant codification (see Fig. 11-11). In what follows 
we shall confine ourselves to binary codification. 

Let us first illustrate a self-correcting technique that will correct a 
single error. The first step is the formation of the redundant codifica- 
tion. Suppose that the information-carrying part of the codification 
has 4 bits, Jo, /1, J2, Zs—say, for example, 1101. We form an array as 
shown in Fig. 11-12 (with the top row all units and the columns otherwise 
having all possible combinations of zeros and units). The information- 
carrying codification is set down beside the array as a column. The 
desired redundant codification is formed bit by bit by comparing this 
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column successively with each column of the array, counting the number 
of units in common (i.e., logically multiplying the columns and counting 
the number of units in common). If the number of units is even, a zero 


0123 4567 

Io =1 1111 1111 

Information-carrying } J; = 1 0101 0101 
codification In =0 0011 0011 
Ig =1 0000 1111 


Redundant codification 1010 0101 


Fig. 11-12. Work for forming redundant codification for single-bit self-correction. 


is placed under that column of the array; if odd, a unit. For example, 
for the sixth bit, under column 6 we have 


1 1 1 
1\ fo\_ fo 
0 1] \o 
1 1 1 

“re = 0 

for the seventh bit 

1 1 1 
Seco eee 
0 1} \o 
1 1 i 

rp =l 


and so forth. 
Suppose that this redundant codification 1010 0101 is stored, and dure 
ing the storage the third bit r3 becomes changed. Then the incorrect code 
is 1011 0101. Of course we assume that we know nothing about the 
error in the codification except that there may be at most a single error, 
The problem remains to determine the correct bits Jo, I1, I2, and 7, of 
the original information-carrying codification. The correct value of the 
bits Iy, I2, Is, and Jy will each be derived, in this order, in spite of 
the error in the code. First note that, if J; = 0, then because of the 
method of formation we would clearly have ro = 71, r2 = 13, 74 = Py 
and rs = 17; that is, adjacent pairs of bits would be the same. If J; = 1, 
then none of these adjacent pairs would be the same. A single error cat 
affect only a single pair of positions; hence, to find J;, we simply check 
to see whether we have more “‘same’”’ pairs, or more ‘different’ pairs, 
More same pairs means J; = 0; more different pairs means J; = |, 
Similarly J, is determined from pairs (ro,r2), (71,73), (rare), aNd (Pp,Py) 
and J; from pairs (ro,r4), (1,75), (72,76), and (rs,r7z) (see Fig. 11-13 for t 
work). To find Jo, consider an array composed of the rows of 
original array corresponding to the J, Is, or IZ; which are found to 
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units and the redundant-code row. Under each column at this array 
place a zero if the number of units in the column is even, a unit if odd. 
If the number so formed is mostly zeros, Ty) = 0 ; if mostly units, J) = 1. 
Incidentally this new number clearly shows the position of the single 
error (see Fig. 11-14). 


o 1 2 8 
Incorrect redundant codification: 1 0 11 0 


3 pairs different, 
1 pair same; I, =1 


1 pair different, 


appropriate pairs 3 pairs same; Ip-0 


Lines connecting 
of positions 


3 pairs different, 
1 pair same; I3=1 


Fia. 11-13. Work for finding [,, Z2, and [s. 


As another illustration, consider a 5-bit information-carrying codifica- 
tion and an extension of the above scheme to correct as many as 3 errone- 
ous bits in the redundant codification. Let the original 5 bits be Ig = 0, 
I, =0, f= 1, 7; = 1,and I, =0. Figure 11-15a represents the work 
required to generate the redundant codification which is to be transmitted 
or stored; the work proceeds just as in the example above. Suppose 


0123 4567 


I, =1 0101 O10! 
I; =1 0000 1111 
Incorrect redundant codification 1011 0101 


New number 1110 1111 —more units; .. Jy = 1 


Fig. 11-14. Work for finding 7). (Note that error occurred in position 3.) 


that erroneous codification shown in Fig. 11-15 results from the storage ; 
then the work required to determine the correct 5 information-carrying 
codification bits appears in Fig. 11-150. 

Observe that in the first example we used 8 bits to codify 4 informa- 
tion-carrying bits, or a redundancy of 4 bits; in the second example we 
used 16 bits to codify 5 bits, or a redundancy of 11 bits. In the first, 
case we are trying to recognize 9 different possibilities, ie., no error, or a 
single error in one of eight positions. This clearly takes 4 bits, the 
redundancy used. For the second case we are trying to recognize 697 
different possibilities, i.e., no error, a single error in one of 16 positions, 


a double error that can oceur in ( “* = 120 different ways, or a triple 
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error that can occur in = 560 different ways.t To distinguish 


3 
among 697 possibilities, we clearly need 10 bits; so the 11-bit redundancy 
of the second example is one more than the minimum required. How- 
ever, note that the Oth column of the array could have been omitted, 
and the scheme would work just as well. This would give us our mini- 
mum 10-bit redundancy. 








012 83 45 67 8&8 9 1011 12 13 1415 

I-00 T1L111111 212121a434a3i3i2 

(a) : h=-0 0101 0101010101021 
Generating the Information- : 

redundant a Iz=l 0011 00110011 0011 

codification | SoMfcation 1 7.24 90001111 00001111 

-=0 0000 0000111211111 

Redundant codification: 0011 3110000i1i31id31100 

Erroneous codification: 0010 111000110100 


3 pairs different 
5 pairs same 
. I,=0 


5 pairs different 
(b) 3 pairs same 


Regenerating “ Tg=1 
5 pairs different 
3 pairs same 

. I = 1 


——_—_—> 


—_—_> 


the original 
information 


——> 





3 pairs different 
5 pairs same 
oe I, =0 


——> 





0011 001100110011 

0000 11110000 111124 

001043111000110100 
New number: 7-0 0001 0010 0000 1000 
Note that errors occurred in positions 3, 6, and 12, 


Fig. 11-15. Scheme illustrating three-error correcting process. 


Superimposed Irredundant Codification. As we have seen, a binary 
codification that will distinguish uniquely between N characteristics 
requires at least log, N bit positions. Thus, if 4,500 characteristics are 
in the list, 13 bits are required. It frequently happens that, although 
N characteristics could occur, fewer than N of these actually appear itv 
a particular situation. However, it is in general not known beforeha 


+ Recall that the number of rap of taking X objects Y at a time is given by 


« . ‘ x 
binomial coefficient ai = Vix —-Y) 
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how many will appear. In such cases, particularly when a substantive 
codification for the characteristics is used, at least log, N bit positions are 
used for the codification anyway, to take account of all eventualities. 
Thus, even though not all 4,500 characteristics will occur, 13 bits might 
still be reserved for the codification. 

Furthermore, suppose that we are dealing with items each of which is 
associated with X of these characteristics; then, according to our above 
discussion, at least X log, N bit positions must be used to identify an 
item. For example, if X = 3 and N = 4,500, then 3 X 13 = 39 bit 
positions would be necessary. Thus, in principle, V* items can be dis- 
tinguished this way; but as we observed above, we are here considering 
situations where they do not all actually appear. Evidently we are 
being wasteful of bit positions because fewer than N* items require 
fewer than X logs N bit positions for a codification. The problem there- 
fore arises: How can weuse fewer than X log2N bits for our codification, say 
H < X logs N, and yet maintain our ability to distinguish among all N 
characteristics? The answer is found in the concept of superimposed 
codification. 

By the superimposed codification corresponding to X characteristics 
we mean the result of forming the logical sum of the individual codifi- 
cations for these X characteristics. That is, if the units represent, for 
example, notches on the edge of a card, two notches in the same position 
are equivalent to a single notch. Thus, if three characteristics have the 
following codes: 














Position 
Characteristic 
12 3 4 5 6 7 8 9 1011 12 13 14 15 16 17 18 19 20 
1Stesacata 001000i10i1010000000 0 0 
2d.. 00010010:1000010000 0 0 
BY: eee ee 000000 0011000010010 





then the corresponding superimposed codification is 
0 0 1 1 00 1 01 1 100 10100 1 «0 


We shall illustrate by means of an example how superimposed codification 
can be made to satisfy the above criterion of using fewer than X loge N 
bits and yet maintain the ability to distinguish among all N character- 
istics, Suppose as above that N = 4,500. Let us codify the char- 
acteristics as follows: Use 20 positions, and let each codification con- 
tain precisely 4 units, This codification will allow us to code each 
characteristic uniquely, because 20 positions can be taken four at a time in 
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suppose that each item is associated with three characteristics; that is, 
X = 3. Then let the codification of the item be the superimposed codifica- 
tions of its associated three characteristics. Note that the codification 
for an item now comprises only 20 positions, rather than the 39 thought 
necessary above, saving about half the positions. 

The advantages of this technique can most readily be visualized in 
terms of the marginal punched cards described above, where each card 
represents an item, and where the marginal positions are the bit positions 
with a punch representing a unit, no punch a zero (see Fig. 11-16). Sup- 
pose that an item is associated with the three characteristics whose 


1st characteristic 







2d characteristic 


3d characteristic 


False drop 


Fig. 11-16. Example of superimposed codification. 


codifications appear above and that the 20 marginal positions of the 
item card are punched with the superimposed codification. Then, if all 
items associated with the first characteristic are being searched for, the 
sorting needles are placed in positions 3, 7, 9, and 11, and clearly the 
proper item cards will drop. Similarly for the second or third character 
istic. However, we must evidently pay a price for this convenience and 
saving in ttem-codificatton positions. For suppose that the code for some 
other characteristic is 


3 


10 11 19 
00100 00001 10000 00010 


Our item card (see Fig. 11-16) is not associated with this characteristie 
but nonetheless will drop in the search for items associated with this 


characteristic. The reason for this is that our codifications overlapped, 
, 


allowing for more than three combinations of 4 units. In fact, in 0 
illustrative example, nine punches appear, allowing for 


9 9! } 
(2) ~ 419 —4)1 — ie 
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such possibilities, and hence 126 — 3 = 123 possible false combinations. 
However, the situation is not so bad as it seems, for remember that we 


20 


supposed that not all the ( ve = 4,845 possible characteristics actually 


appear, and hence not all the false combinations will ever be sensed. 
The subject of superimposed codification therefore resolves itself into the 
problem of determining the probability p that an item (card) not associated 
with a particular characteristic will be selected (dropped) during the search 
for that characteristic. We shall refer to p as the probability that an 
item (card) will be a false drop. The formulas given below for this 
probability p are based on the assumption that the codes for the charac- 
teristics are randomly chosen and that the associations of items with 
characteristics are also randomly chosen. 
Let H = number of positions in superimposed codification 
Y = number of units in a characteristic codification 
X = number of characteristics associated with an item 
i=Y-1 
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and so forth, whence t 
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| Thin reduced form of the summation waa developed by William Tt. Smith TH, 
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For our example above, H = 20, Y = 4, X = 3; hencet 
20 ane : 
ri —1|] — (i+ £.4+ #3 4+ Ea) 
7 20\' 
4 
Af See 12 
_ 0.1131692 X 10 0.10890324 K 10? _ 0.0377 





0.11323941 x 10!2 
From probability theory we know that, if there are M items, then the 
expected number of false drops willbe Mp. For ourexample, if M = 200, 


wo. 
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0.1401 
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P(m) 0.08 


wo 
8m 
8 9 
6 2 
o_o 
Oo 1 


2 3 4 5 6 7 8 9 10 11 12 
m 


Fic. 11-17. Probability P(m) of obtaining exactly m false drops for H = 20, Y = 4, 
X = 3, and M = 200 (here h = 4). 


we would expect 200 X 0.0377 = 7.54 false-drop cards during the search 
on a particular characteristic. To find the probability P(m) of obtaining 
m false drops, we use the formula from probability theory, 


Pom) = (MY) px — pis 


In Fig. 11-17 we have graphed some of these values for our example. 

So far we have been considering false drops when searching on oné 
characteristic only. Now we shall discuss the situation that arises 
when more than one characteristic is used in the search. Let n be the 
number of characteristics entering the search. The codifications of 
these particular characteristics may overlap. Let us suppose that thelr 


J J 
{ For numerical calculations some helpful results are (i) =1, (‘) =U 
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logical sum has precisely h units (punches). Then 


j=n-1 t=h-1 


> Pp Eni 





p= 


and so forth, whencet 
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x 
Fy} * (7) . This is the case 


Observe that, if (7) >» n, then y 


y= 
for our example of 7 = 20, Y @ 4, and X = 3, Let us suppose that we 
are searching on three characteristics with the codifications given above 


{| This reduced form of the summation was developed by William R, Smith LT, 
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(see Fig. 11-16); then h = 9. Thus we have 


20\3 =X 
(y-Sn. 
” iM  _ (0.113732 — 0.113716) X 10” 
P Oo) 0.113732 X 10” 
4 


It is interesting to compare P(m) when searching on a single characteristic 
with P(m) when searching on three characteristics that overlap to pre- 
sent the nine units of our illustration (see Exercise 11-26). Observe that 
h can range from 4 to 12; some values of p for various values of h are 


= 0.00014 





h 4 6 9 12 





Dp 0.0377 0.0061 0.00014 0.0000003 


Note that, as long as ) >n, then the value of h is all-important 


and p is effectively independent of n. However, if, for example, H = 6, 
Y = 2, X = 3, then an n of 1, 2, or 3 can no longer be neglected. 


11-8. Exercises 


11-1. By means of the information given in Table 11-1 complete the lists and tables 
partially shown in Fig. 11-3a and b. 

11-2. Using the information given in Fig. 11-1 and Table 11-1, compile Tabledex 
tables using as the characteristics the words themselves, in alphabetical order. 

11-3. Using each of the three lists completed in Exercises 11-1 and 11-2, search for 
all items associated with each of the following sets of given characteristics: evalude 
tion, concept, design; and differentiation, application, England, gas. 

11-4. Assign population-based numbers to the items of Fig. 11-1, and compile the 
lists of the first searching method from the information of Table 11-1. 

11-5. Draw a flow chart for each of the searching methods given in this chapter, 

11-6. Punch a hole in each corner of sixteen 3 X 5cards. Then write the names of 
the authors (see Fig. 11-1) on these cards, one name on each card. Mark off eight 
equally spaced positions on one 5-in. edge, nine positions on the other 5-in. edge of 
each card. Assign one word for each position, the same on each of the 16 cards, 
Next notch the edge of each card corresponding to the positions of words characteriatie 
of this author’s article (see Table 11-1). The cards can now be searched by placing # 
knitting needle under the deck of cards at the position of the desired word (see Vig, 
11-18); the notched cards will drop, and the unnotched ones can be separated by putting 
another knitting needle through a corner hole of the raised cards. By appropriately 
separating collections of these cards as in Fig. 11-7, find all items associated with at 
least six, at least five, at least four, and at least three of the following seven given 
characteristics: differentiation, application, England, evaluation, design, gas, and concept. 

11-7. Using the cards made in Exercise 11-6, find all items that are associated W 
design and England and as many of the following as possible: differentiation, appl 
tion, evaluation, gas, and concept. : 
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Fic. 11-18. Use of knitting needles to search marginal-punch cards. 


11-8. Using the cards made in Exercise 11-6, find all items that are associated with 
either evaluation or design or both and as many of the following as possible: differentia- 
tion, application, England, gas, and concept. 

11-9. How can the process of searching for all except one, except two, etc., be accom- 
plished by means of the Tabledex tables? (Hrnr: A succession of tables must be 
used: one table for the first pass, two tables for the second pass, three tables for the 
third pass, etc.) 

11-10. Draw a flow chart for a search for items associated with all the given charac- 
tereristics except one, except two, etc. Assume the data to be given as in Fig, 11-20. 
Then assume the data to be given in terms of the Tabledex tables, as in Fig. 11-2c. 

11-11. Draw a flow chart for coding each of the four ordering methods of Sec. 11-5. 

11-12. After shuffling a deck of playing cards, order them by each of the four 
methods of Sec. 11-5 without regard to suit (i.e., the four Queens, for example, will 
appear adjacent, after the Jacks and before the Kings, but not ordered by suit among 
themselves). 

11-13. Suppose that one were ordering alphabetically by the fourth method, using a 
computer with 10 tape units. What radix could be used, and how would the tape 
units be utilized? 

11-14. Draw a flow chart for coding a merge sort using four tapes, passing the 
numbers from two tapes to the other two. 

11-15. Draw a flow chart for coding a merge sort, passing the numbers from m tapes 
to n tapes, where m and n can be specified arbitrarily. 

11-16. What is the maximum number of passes required to put a deck of 52 cards in 
some order by suit as well as by denomination, merging from two bins to two bins? 
From three bins to one bin (counting the pass required to redistribute the cards into 
the three bins after each merge)? From four bins to four bins? From five bins to 
three bins? Considering these results, discuss the most efficient way to utilize a 
fixed number of tape unite in a merge sort, 

11-17, How many codifications can be made of no more than 10 positions with 26 
symbols? If K « 26, what is the moat efficient number of positions to use for codify- 
ing 100,000 worda? 
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11-18. Given the information code 10101, find the redundant codification. 

11-19. If a redundant codification was received as 0100 1000 0001 0011, what was 
the information code sent and in what positions did errors occur? Suppose that 
0100 1000 0001 0001 was received; in what positions are the errors? 

11-20. Suppose that there were 6 bits in an information code; if the scheme given 
above were used, how many errors could be detected and how many bits would there 
be in the redundant codification? 

11-21. For H = 4, Y = 2, and X = 2, find p by means of the formulas of Sec. 11-7 
forn = 1, h = 2; for n = 2,h = 2;forn = 2,h = 3; and forn = 2,h = 4. Check 


x 2 
your result by making an enumeration of the (=) = (3) = 36 equally probable 


conditions. 

11-22. Compute pforH = 6,Y = 2,X = 3whensearchingon asingle characteristic. 

11-23. Compute p for H = 6, Y = 2, X = 3 when searching on three characteristics 
(that is, n = 3) that overlap so that h = 5. 

11-24. Compare the results of Exercises 11-22 and 11-23 by graphing P(m) for 
M = 50 in both cases. 

11-25. Find the expected number of false drops for H = 100, Y = 6, X = 4 when 
h = 15. 

11-26. Graph P(m) for a search on three characteristics that overlap to present 9 
units (i.e., h = 9, whence p = 0.00014—see above). Compare with Fig. 11-17. 

11-27. Irredundant Coding. To our knowledge Mooers [see the References] first 
discussed irredundant coding. Formulas that differ from those given in this chapter 
are found in C. 8. Wise, Mathematical Analysis of Coding Systems, in R. 8. Carey and 
J. W. Perry (eds.), ‘Punched Cards,” chap. 20, Reinhold Publishing Corporation, 
New York, 1951. The formulas given by Wise are not precise. (Why?) Prove 
the formulas given in this chapter of our text by means of an exact analysis of the 
problem. 
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APPENDIX 


The purpose of this appendix is to present a sampling of instruction 
types from three-, two-, and one-address commercially available com- 
puters. The instructions described do not at all represent the repertoire 
of these computers, but rather just some sample instructions. The sam- 
ple instructions were not chosen to reflect the versatility or scope of the 
respective computers, but rather were chosen primarily to illustrate some 
of the principles expounded in the text (see Secs. 3-1, 3-2, 3-4, 3-5, and 
3-8). Also, in some cases we have simplified the description of the 
instructions for pedagogical purposes. For detailed treatments of these 
instruction systems, see the respective programming manuals referred to 
below. 

Honeywell 800:+ Example of a Three-address Instruction System. 
The Honeywell 800 computer utilizes a three-address instruction system 
(see Fig. A-1) that also includes relative, indirect, compound, and aug- 
mented addressing techniques (see Sec. 3-5). Figure A-2 summarizes 
the alternative possibilities, which for each address in the instruction are 
distinguished by three bits, called X, Y, and Z. The X bit is in position 
4, 5, or 6 of the operation code, corresponding to the a, 8, or y address, 
respectively; the Y bit is in position 1 of the a, 8, or y address itself; and 
the Z bit is in position 7 of the a, 6, or y address itself, but is not utilized 
for direct and simple-relative addressing. There can be up to 16 banks 
of core memory, each bank of 2,048 words; there are eight banks of regis- 
ters (which constitute the so-called “special memory”). Augmented 
addressing is utilized for the selection of the appropriate memory and 
register banks, as will be described. As seen in Fig. A-2, six types of 
addressing are available, four for addressing the memory and two for 
addressing the registers, as follows: 

(a) DIRECT MEMORY ADDRESSING. The 11-bit address is augmented 
on the left by the four most significant bits of the current-address register, 
to produce a complete 15-bit address in the same memory bank as the 
instruction, 


{ Bee “Honeywell 800 Programmers’ Reference Manual,’’ Minneapolis-Honey well 
DATAmatio Division, Wellesley Hills, Mana, 1060, 
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1 3 6 9 12 15 18 21 24 27 30 33 36 39 42 45 48 
Binary (octal) 
numbers 
as 
Sign 
1 4 8 12 16 20 24 28 32 36 40 44 48 
Decimal 
numbers 
TE 
Sign 


1 6 12 18 24 30 36 42 48 


net 2 ee ee ee 


1 15 


es, [| 


(a) 
1234567 12 24 36 48 
boot 
Op code ! yi! 
1 ae | 
sjc+ top code : ‘ ; , : ’ 
(0) 


Fig. A-1. Number and instruction formats for the Honeywell 800 computer. In the 
instruction format the bit in position 1 denotes which of two possible current-address 
registers (S or C) is to be used for the address of the next instruction. 
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(b) Relative 
7 
. 
(f) Register direct 


(a) Direct 


Instruction register 


Fig. A-2. The six methods of addressing the Honeywell 800 computer (sce text for 
descriptions), , 
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(b) RELATIVE MEMORY ADDRESSING. Bits 2, 3, and 4 of the address 
select one of eight registers; these bits are, however, first augmented on 
the left by the three most significant bits of the current-address register 
so as to select the proper bank of registers. Then the contents of this 
15-bit register is added to the remaining 8 bits of the address to form the 
desired relative address. Note that this final address can be in a different 
memory bank from that of the instruction (why?). 

(c) INDIRECT MEMORY ADDRESSING. Bits 8 through 12 of the address 
are augmented by the three most significant bits of the current-address 
register, to select one of 32 registers in the register bank which cor- 
responds to the location of the instruction being executed; the contents of 
this register becomes the desired address. As in relative memory address- 
ing, the final address can be in any memory bank. As an added feature, 
after the core-memory access has been completed, bits 2 through 6 of 
the address are added to the contents of the selected register as an 
increment, and the sum is replaced in the register (resulting in register 
incrementation). 

(d) COMPOUND MEMORY ADDRESSING. Bits 2 through 4 of the address 
in the instruction being executed are augmented by the three most sig- 
nificant bits of the current-address register, to select one of eight registers 
in the register bank of the instruction; we shall call the register so selected 
the relative register. Bits 5 through 15 of this relative register are added 
to bits 5 through 12 of the address in the instruction, ‘“right-justified”’ 
(i.e., bit 15 is aligned with bit 12, 14 with 11, etc.); the resultant sum is 
augmented with bits 2 through 4 of the relative register, completing a 
14-bit “relative” address in positions 2 through 15. Eight of these 
14 bits, namely, bits in positions 2 through 4 and 11 through 15, select 
another register, which we shall call the indirect register. The contents of 
the indirect register forms the address of the desired word in the core 
memory, completing the compound indexing process. Of the six remain- 
ing bits of the ‘“relative’’ address, the bits in positions 5 through 9 are 
added (right-justified) to the contents of the indirect register as an incre- 
ment (register incrementation) after the core-memory access has been 
completed. The bit in position 10 should be the original Z bit, and hence 
care must be taken that it is not changed from 1 in the formation of the 
“relative” address. 

(ec) RELATIVE REGISTER ADDRESSING. This is similar to compound 
memory addressing in that a relative register is first chosen and then 
another register is selected from bits in positions 2 through 4 and 11 
through 15 of the ‘relative’ address. However, here the selection 
of this latter register is the desired result. Care must be taken that 
the Z bit, located in position 10 of the “relative” address, this time 
remains 0, 
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(f) DIRECT REGISTER ADDRESSING. Bits 11 through 15 of the address in 
the instruction are augmented by the three most significant bits of the 
current-address register to form the address of the desired register. 
As an added feature, bits 2 through 7 are added to the contents of the 
register selected after such contents has been utilized by the instruction 
(resulting in register incrementation). 

The following are typical instructions of the Honeywell 800 (with the 
manufacturer’s symbolic operation codes shown in square brackets): 


Arithmetic instructions 
(a) + (8) > y [BA] 
(a) — (8) — 7 [BS] 
(a) X (8) > v [BM] 
(a)/(8) — y [BD] 
Transfer 
Transfer (a) into 8; take (7) as next instruction [TS]. 
Comparison 
If (a) ¥ (8), take (y) as next instruction; otherwise continue as usual 
[NN]. 
Accumulate 
Here a must be indirect or compound, utilizing register incrementation. 
The sum is formed of 8 numbers, starting with the contents of the word 
specified by a and continuing with the addresses formed through incre- 
menting; then the result is placed in y [DT]. 
Multiword transfer 
Transfer 6 words from successive positions starting at a to successive 
positions starting at y [TN]. 
Logical instructions (where -, +, and ~, but not >, are logical operations) 
(a) - (8) + (a): (8) > y [HA] 
(a) + (8) — y [SM] 
(a) + (8) > y [EX] 
Masked instructions 
In order to preserve part of the result word [usually ()] after an opera- 
tion is performed, a so-called “mask” word can be used. Most often, — 
the bits in the result word corresponding to the zero bits in the mask 
word remain unchanged. The address of the mask word is composed © 
by bits from the instruction augmented by bits from a special mask 
regester; the mask register need not be addressed since it is automatically 
referenced during the execution of mask-utilizing operations. The 
mask register can be loaded in the same manner as any other register 
(see parts e and f above). A typical masked instruction is a right« 
circular shift: shift the contents of a right (around to the beginning of 
the word) by the number of positions specified by bits 25 through 80 


} 


uit 
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(in 8) and put the result into y, but preserving those bits of y cor- 
responding to the zeros of the mask word; the mask word is found as 
the contents of the address composed by augmenting bits 31 through 
36 of the shift instruction with bits 2 through 10 of the mask register 
of the same register bank as the instruction [SPS]. 


Univac 1103A_ Scientific:;| Example of a Two-address Instruction 
System. The Univac Scientific computer has a two-address format (see 
Fig. A-3); it has only a single addressable register, the Q register, which 


3533 30 27 24 21 18 15 12 9 6 3 0 
Number 
format 


t sign 


35 30 29 15 14 


0 
Instruction 
format 
Fig. A-3. Number and instruction formats for the Univac 1103A computer. 


cannot be used, however, for relative indexing. Typical instructions 
(with the manufacturer’s symbolic operation codes shown in square 
brackets) are: 


Arithmetic instructions 

(a) + (8) > a [RA] 

(a) — (8) > a [RS] 

(a) X (8) > ace [MD] 

(«)/(8) > B [DV] 

(a) + (Q)(8) — ace [MA] 

(a) X (Q) + (8 — Q[FP] 
Logical instructions (where - and ~, but not + and —, are logical 
operations) 

(a) -(Q) > B [QT] 

(a): Q) + (8): @)— 8 [QS] 

(ace) + (a): (Q)— B [QA] 

Here Q acts as a mask for the operand which it logically multiplies. 
Repeat instruction 

Repeat the following instruction the number of times given by positions 

15 through 26. If bit 27 is a unit, then the remaining portion of a is 

increased by 1 after each execution of the following instruction; if bit 

28 is a unit, then the same is done for 6 [RP]. 


{See “Univae 1108A Programming Manual,” Remington Rand Univac Division 
of Sperry Rand Corporation, Philadelphia, 1958, 
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Comparisons and jumps 
If (a) > (acc) take 6 [TJ]; replace (a) with C + 1 in its 8 positions, 
where C is the contents of the current-address register; then take 6 [RJ]. 
Other instructions 
Normalize (a), and store the number of positions shifted in (8), with 
the normalized result found as (acc) [SF]; transmit (a) to 6 [TP]; 
transmit the a address of (a) to the address of (8) [TU]; similarly 
transmit for 6 addresses [TV]. 


IBM 7090:+ Example of a One-address Instruction System. The IBM 
7090 has a one-address instruction system (see Fig. A-4) and includes 
three index registers for relative addressing. Bits in positions 18, 19, and 


01 35 


Fixed-point 
number 


tsign 


01 8 35 


Floating-point 
ainber ° Exponent + 128 Mantissa (proper fraction) 


£ Sign (of mantissa) 


0 12 13 18 19 20 21 35 


Fic. A-4. Number and instruction formats for the IBM 7090 computer. 


20 address these three index registers, the addresses of which are respec- 
tively 001, 010, and 100. The relative address is found by subtracting 
the contents of the designated index register from a. The index registers 
can themselves be loaded and modified by special instructions, as will be 
mentioned below. Reference to two or more index registers at the same 
time, i.e., by loading positions 18, 19, and 20 with 110, 101, 011, or 111, 
results in the use of a number obtained by forming the logical sum of the 
register contents indicated for the relative addressing. Indirect address« 
ing is also possible, and is indicated by placing 11 in positions 12 and 1% 
of the instruction: here the direct address specified by the instruction 
(whether or not relative) is interpreted as the location in the core memory 
of the word whose contents (i.e., bits 21 through 35) is the final desired 
indirect address; this final desired indirect address may itself be a relative 
address if so specified in positions 18, 19, and 20 of the location where it 
was found. The use of relative addressing in conjunction with indirect 


{See “Reference Manual IBM 7090 Data Processing System,’ International 
Business Machines Corporation, Data Processing Division, White Plains, N.Y., 1000, 
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addressing, as shown in Fig. A-5, has been termed compound addressing 
(see Sec. 3-5). 

Typical instructions (with the manufacturer’s symbolic operation 
codes shown in square brackets) are the following: 


Arithmetic instructions 

(a) + (ace) — ace [ADD] 

(ace) — (a) > ace [SUB] 

(a) > ace [CLA] 

For multiplication, the MQ register, which is simply a right extension 
of the accumulator, is utilized, and the double-length product is stored 
in both MQ and the accumulator. The MQ register is loaded by (a) > 
MQ [LDQ], and its contents is stored by (MQ)— a [STQ]. The 
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Fia. A-5. Addressing methods in the IBM 7090 computer (see text for descriptions). 


multiply instruction becomes (MQ) X (a) — acc-MQ [MPY]. For 
division a double-length divisor stored in acc-MQ is divided by (a), 
with the resulting quotient appearing in MQ, and the remainder in acc. 
The divide instruction becomes (acc-MQ)/(acc) > MQ [DVH]. 
There are also floating-point arithmetic operations, both normalized 
and unnormalized. 

Word-manipulation instructions 
Shift (acc) left @ positions [ALS]; shift (acc-MQ) left a positions [LLS]; 
(acc) > a [STO]; the location of this instruction +1 is stored in bits 
21 to 35 of (a) [STL]; (acc) > MQ and (MQ) — ace [XCL]; take 
a [TRA]; if (acc) is positive, take a [TPL]; if (acc) is negative, take 
a [TMI]; if (ace) > (MQ), take a [TLQ]. 

Index-manipulation instructions 
Place the logical complement of the current address in the index register 
specified in positions 18 through 20, and take a [TSX]; add the num- 
ber in positions 8 through 17 to the specified index register, and take 
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a [TXI]; if the contents of the specified index register is greater than 
the number in positions 3 through 17, take a [TXH] (Norte: In these 
last two instructions the operation code is located in positions 0 
through 2 of the instruction); transfer the address part of (a) to the 
specified index register [LXA]; put a into the specified index register 
[AXT]; transfer the contents of the specified index register into the 
address part of (a) [SX A]. 

Logical instructions (where -, +, and ~ represent logical operations) 
(ace) + (a) > ace [ORA] 
(acc) + (a) > a [ORS] 
(acc) : (a) > ace [ANA] 
(ace) - (a) + (ace): (a) — ace [ERA] 
(ace) > ace [COM] 

Sense-indicator instructions 
These operations, which are associated with the use of a special sense- 
indicator register, denoted by SI, serve to load, store, and logically 
change the contents of the register (where -, +, and ~ represent logical 
operations): («) > SI [LDI]; (SI) > @ [STI]; (SD + (a) > SI [OSI]; 
(SI) : (a) > SI [RIS]; (SI) - @) + GD : (a) SI [IIS]; if (D - (ace) 
contains all zeros, take a [TIO]; if (SI) - (acc) contains all zeros, 
take a [TIF]. In all except the straight transfer operations, (a) can 
be considered as a mask, with only those positions of (SI) considered 
which correspond to units in (a) (why?). 


It is instructive to note that the 7090 one-address system requires 
special instructions to handle the index registers, whereas the same effect 
is obtained in the 800 three-address system by utilizing the capability 
for directly handling the index registers in the same manner as core words. 
Also, the indirect addressing in the 800 system is accomplished through 
the very high speed special memory of index registers, whereas the 7090 
system requires intermediate access to the slower core memory. 
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Page references in boldface type indicate definitions 


A, COBOL, 263 
Absolute accuracy, 425 
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Accelerating convergence of iterative proc- 
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Accumulate instruction, Honey well 800 
computer, 540 
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addressing, 92 
of digital differential analyzer, 128 
Accuracy, 421-429 
absolute, 425 
relative, 424-425 
Acoustic delay-line memory, 47 
Actual parameter, ALGOL, 229-232 
Actual-parameter list, ALGOL, 229, 234 
Actual-parameter part, ALGOL, 229 
ADD statement, COBOL, 249-250 
Addend, COBOL, 249, 250, 253 
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(See also Addition) 
Adding operator, ALGOL, 212 
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Addition instructions, types, 100-101 
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(See also Addressing) 
ADDRESS clause, COBOL, 276 
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in IBM 7090 computer, 542-548 
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indirect, 108-109 
in Honeywell 800 computer, 538, 539, 
544 
in IBM 7090 computer, 542-544 
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partial, 107 
relative, 110, 112 
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Airline reservation system, 23-25 
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bibliography, 238-239 
characters, 205-206 
COBOL and, differences, 240-241, 245, 
251-252, 254, 256 
declarations (see Declarations) 
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history, 203 
introduction to, 193-202 
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statements (see Statement) 
structures, 204 
Algorithmic language (see ALGOL) 
Alphabet, Korean, 195, 196 
ALPHABETIC class, COBOL, 264 
Alphabetical data, COBOL, standard 
positioning, 280 
ALPHANUMERIC class, COBOL, 264 
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Alphanumerical data, COBOL, standard 
positioning, 280 
ALTER statement, COBOL, 250-251 
Alternative, notation for, ALGOL, 199 
COBOL, 241, 285 
syntax definition, 306 
Alternative data-names, COBOL, 267-270 
Alternative decisions, 325 
under conditions of risk, 327-328 
sequence, 330-333 
Alternative diagnostic probabilities, 341— 
344 
Analogy, hand computation, 28, 29 
Analysis, logical, experimental results, 484— 
486 
intelligence reports, 486—487 
numerical, 383-434 
references, 433-434 
syntactical, 295 
Anatomy of language, 204 
AND, COBOL, 242, 247-248 
and, logical, 437 
Antecedence solution, 483-484 
Applications of digital computers, 1-26 
aids to medical diagnosis, 25-26 
airline reservation system, 23-25 
biomedical, 10-12 
data processing, 20-26 
machine-tool control, 12-14 
man-machine simulation system, 18-21 
motion of charged particles (Argus 
project), 8-10 
numerical solutions to equations, 8-12 
process control, 12-15 
references, 49-52 
simulations, 15-21 
Approximation, continued-fraction, 394— 
395 
errors in, 422 
polynomial (see Polynomial approxi- 
mations) 
series, 393-394 
successive, 387-393 
ARE options, COBOL, 272-273 
Arguments, definition, 26-27 
selection, 28-30, 33-34, 39 
(See also Addresses; Addressing; 
Memory) 
Argus project, 8-10 
Arithmetic, floating, 98-100 
Arithmetic expression, ALGOL, 208, 209, 
211, 213, 214, 217, 221, 228 
simple, 211-212, 213 
COBOL, 213, 247-249 
Arithmetic instructions, 100-102 
Honeywell 800 computer, 540 
IBM 7090 computer, 543 
UNIVAC 1103A computer, 541 
Arithmetic operations, number-system, 61— 
2 


significant figures, 426-429 
Arithmetic operator, ALGOL, 206, 212 
COBOL, 242 
Arithmetic units, 28 
control, 28-30, 39 


Array, ALGOL, 230 
Array, in COBOL, 265-267 
in solving partial differential equations, 
416-421 
Array declaration, ALGOL, 226-228, 230 
Array-manipulation computer, 149-150 
(See also Business logistic computer) 
Array notation, logistics computer, 143 
Arrow, ALGOL, 212 
COBOL, 248 
Article, definite, English, 301-303, 306 
ASSIGN OBJECT-PROGRAM TO, 
COBOL, 276 
ASSIGN TO, COBOL, 275 
Asssignment, COBOL, 275-276 
Assignment list, COBOL, 276 
Assignment statement, ALGOL, 217-219, 
228 
automatic translation, 311-324 
modified syntax, for efficient transla- 
tion, 320-321 
for translator, 312 
Association unit, 369 
AT END clause, COBOL, 284-285 
Augmented addressing, 108-109 
AUTHOR clause, COBOL, 277 
Automatic program, 159 
compiler routine, address assigner, 185— 
186 
advantages, 180-181 
algebraic, 186-191 
decoder, algebraic symbols, 186-189 
loop symbols, 190-191 
recursive, 190-191 
translator, 182-185 
interpretive routines, data-handling, 172= 
179 
simulational, 168-172 
(See also Automatic programming) 
Automatic programming, 159-192 
advanced concepts, 295-298 
computing speed and, 179-181 
memory allocation, 179 
references, 192 
Automatic-programming language, 159 
ALGOL, 193-239 
COBOL, 240-294 
introduction to, 193-202 
translation, 298-324 
references, 378 
Automonitor program, 161-163 
Auxiliary switches, 104-105, 121 
Axiomatic development of Boolean algebra, 
references, 493-494 
Axioms, geometric, 350 


B (blank space), COBOL, 263, 281 
Babbage’s analytic engine, 2 
Babel, 299, 300 
Basic symbols, Boolean algebra, 448n, 
Basis, Boolean algebra, 448-449 
constrained, 468-469 
for dependent elements, 474 
designation numbers with respect to, 
469-471 
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Basis, constrained, determining, 468-469 
logical, 448-449 
symptom-disease complexes, 336, 341 
positions of columns, 449 
standard, 449 
and truth table, 448 
Bayes’ formula, 342-344 
BEFORE, COBOL, 285 
BEGINNING-FILE label-record type, 
COBOL, 273 
BEGINNING-TAPE label-record type, 
COBOL, 273 
Best-fit polynomial approximation, 402— 
404 
Bibliography (see References) 
Binary addition table, 61 
Binary arithmetic operations, 61-62 
Binary coded decimal, 96-97 
Binary codification, 523 
Binary-decimal conversion, 59-60 
Binary multiplication table, 61 
Binary number system, 55, 56 
Binary numbers and bits, 35 
Binary-octal conversion, 59 
Binary point, COBOL, 281 
(See also Decimal point; Period) 
Binary-sexadecimal conversion, 59 
Binary words, 35-36 
Binomial-coefficient look-up routine, 78 
Biochemical logical problems, 484-486 
Biomedical research, 10-12 
(See also Medical diagnosis) 
Bit-handling instructions, 102-103, 136 
BITS, COBOL, 281 
Block, ALGOL, 215-217, 219 
grammar, 228 
COBOL, 243, 272 
of words, 105 
BLOCK CONTAINS clause, COBOL, 372 
Block diagram, digital computer, 26-31, 33, 
136, 138, 140, 150, 152 
heuristic program, 354 
types of heuristics, 356 
Body, procedure, ALGOL, 230-232 
Boolean algebra, 435-494 
applications in logical problems, 483-487 
as algebra, of propositions, 440 
of sets, 442-444 
constrained, 471—472 
constraints, 467-472 
designation numbers, 447-450 
equation solution, 482-483 
equations characterizing, 440-441 
expressions, 439-442 
forms of functions, 451-460 
historical background, 435—436 
logical dependence, 472-474 
matrices, 474-487 
propositions, 486-488 
references, 403-404 
representation, hypereube, 446 
lattice diagram, 446-447 
net, 442-444 
searching problem, 400-600 
simplification methods, 464-467 


Boolean algebra, transformations, 479-482 
(See also entries under Logical) 
Boolean algebraic equations, 482-483 
Boolean expression, ALGOL, 208-211, 217, 
219-220 
simple, 209-210, 211 
COBOL, 247-248, 251 
in logic, 439-442 
Boolean function, canonical (normal) forms, 
451-452 
constrained basis, 469-471 
definition, 438 
designation number, 449-450 
E, 479-481 
F, 479-481 
fs, 479 
logical dependence and independence, 
472-474 
mongrel form, 452-453 
relation to Boolean matrices, 479-481 
simplest product of sum, 451, 489-490 
simplest sum of products, 454-467 
simplification, references, 494 
(See also Boolean algebraic equations) 
Boolean lattice diagram, 446-447 
Boolean matrices, 474-487 
equality, 475 
implication, 474-475 
logical addition, 474 
logical multiplication, 474 
transpose, 474 
Boolean matrix equations, solutions to, 475- 
476 
all, 476n., 493 
unitary, 476-478 
Boolean operations, ALGOL, truth-value 
definitions, 208 
Boolean term, ALGOL, 209-211 
Boolean transformations, 479-482 
Boolean type, ALGOL, 226, 230 
Borrow, 62 
Boundary conditions, 333 | 
Bounds, index, ALGOL array declaration, 
226, 227 
matrix solution, greatest lower, 476, 477— 
478 
least upper, 475-476, 477-478 
lowest, 476, 477-478 
Brackets, square, in ALGOL, 207, 226, 227 
Brain mechanism, 367 
Branch link, list, 297 
Break-point, 161 
Bugs in program, 160 
Business logistic computer, 140-150 
basic operations, 142-147 
example, building-material purchase, 
141-142 
keeping bank accounts, 142 
spare-parts inventory, 140-141 
illustrations of coding, 147-148 
instruction format, 142-145 
Business-oriented language (see COBOL) 
Business simulation (industrial dynamics), 
1b-10 
BY, COBOL, 260, 265, 265 
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Calculus .(see Boolean algebra; Differential 
equations; Differentiation; Integration) 
Canonical forms of Boolean functions, 451— 
<* 452 
Capacitor memory, 47 
Cards, marginal-punch, 504, 532, 533 
peek-a-boo, 504—505 
punched, 105 
_ searching, 504-506 
sorting, 504-505, 507, 532, 533 
Carry, 62 
Cell, 31 
(See also Memory) 
Certainty, decision under, 325-326 
Character, ALGOL, 205-206 
automatic-programming language, 194— 
195 
COBOL, 241-242 
list, 297 
Characteristics, COBOL, description, 263- 
264 
searching, 148-149 
definition, 495 
Chart, prime implicant, 457-459 
CHECK PROTECT clause, COBOL, 282 
Chemical plant control, 14-16, 132-137 
Chinese, 195, 196 
Class, 442 
COBOL, 261 
equivalence, 498 
CLASS clause, COBOL, 264, 270 
Classes, calculus of (see Boolean algebra) 
Clause, COBOL, ADDRESS, 276 
ADVANCING, 285 
AT END, 284-295 
. AUTHOR, 277 
BLOCK CONTAINS, 272 
CHECK PROTECT, 282 
CLASS, 264, 270 
data-editing, 280-282 
DATE WRITTEN, 277 
editing, 282 
FILE CONTAINS, 271-272 
FILE CONTROL, 276, 277 
FLOAT DOLLAR SIGN, 282 
FROM, 285 
JUSTIFIED, 280-281 
MEMORY SIZE, 276-277 
OBJECT-COMPUTER, 276 
OCCURS, 265-267, 271 
PICTURE, 263-264, 270, 281 
POINT LOCATION, 281 
PROGRAM ID, 277 
RECORD CONTAINS, 272 
REDEFINES, 267-268, 271 
REMARKS, 277 
RENAMES, 268-270 
SIZE, 263, 265, 270 
USAGE, 264, 270 
VALUE, 264-265, 270-271 
VALVE OF, 273-274 
ZERO SUPPRESS, 282 
Clock (oscillator), 36 
Clock pulse, 36 
CLOSE statement, COBOL, 283, 284 


Close-up-space pseudoinstruction, 176 
COBOL, 240-294 
ALGOL and, differences, 240-241, 245, 
251-252, 254, 256 
automatic translation, 299-300 
character, 241-242 
complete example, 285-289 
data description, detailed, 263-271 
data handling, 277-289 
data organization, over-all, 257-263 
environment description, 275-277 
file description, 271-275 
history, 240-241 
introduction to, 240-246 
operational statements, 247-257, 277 
(See also Statement) 
record description, 260-263 
references, 291—294 
words, 242-244 
Code (codification), definition, 498 
Code (program), definition, 7, 41, 53 
Coder, definition, 41 
Codes, decimal, 96-97 
excess-three (decimal-coded binary), 96- 
97 
multiple, 152-153 
octal-coded binary, 63 
recursive, in ALGOL, 221 
symbolic, Honeywell 800 computer, 540- 
541 
IBM 7090 computer, 543-544 
instructions for translator, 311 
UNIVAC 1103A computer, 541-542 
Codification, binary, 523 
irredundant, 526-532, 534 
nonsubstantive, 521 
redundant, self-correcting, 523-526 
substantive, 521 
superimposed, irredundant, 526-532, 534 
Codifying, decoding, 521-522 
definition, 498 
efficiency, 522-523 
encoding, 521-522 
purposes, 521 
Coding, 53-157 
decision, 67-69 
introduction to, 53-54 
minimum access, 96 
principles, 53 
special techniques, 112-118 
input-output instructions, 115-116 
multiway branch, 112-113 
switch sensing, 113-115 
table look-up, 77-78, 116-118 
tactics, 53 
(See also Codification; Codifying; Pro« 
gramming) 
Coding manual, 63-64 
Coding sheet, 41 
Collection, empty, 508 
Column-adding routine, 78-79 
(See also Business logistics computer) 
Combined element, definition, 498 
designation number, 449-450 
(See also Boolean function) 
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Comma, ALGOL, 206, 226, 229, 230 
COBOL, 242, 249, 255, 265, 273, 275, 276, 
279, 283 
Commands, definition, 31 
(See also entries under Instruction) 
comment declaration, ALGOL, 233 
Common business-oriented language (see 
COBOL) 
Comparison, dummy, 90 
logical, 102-103 
Comparison instruction, 67, 89, 91, 94, 104, 
147 
Honey well 800 computer, 540 
UNIVAC 1103A computer, 542 
(See also Conditional statement; Decision 
instructions) 
Compiler, in COBOL, 248 
recursive, 190-191 
Compiler routines, 164, 180-191 
address assigner, 185-186 
advantages, 180-181 
algebraic, 186-191 
decoder of algebraic symbols, 186-189 
flow chart, 184 
kinds, 182 
memory required, 189 
purpose, 182 
translator, 182-185 
Complement, Boolean matrix, 474 
logical, 103 
(See also Inversion; Negation; not) 
Complex, symptom-disease, 337 
probabilities, 343 
Component, key, translator, 307, 310 
syntax definition, 306 
Composition, musical, by computer, 371- 
375 
Compound addressing (see Addressing) 
Compound if statements, ALGOL, 219 
Compound statements, ALGOL, 215-217, 
219 
Computation analogy, 28, 29 
Computational-array drum, 143, 149, 150 
COMPUTATIONAL usage, COBOL, 264 
COMPUTE statement, COBOL, 249 
Computer, analog, 7, 18 
applications, 1-26 
references, 49-52 
array-manipulation, 149-150 
block diagrams, 26-31, 33, 136, 138, 140, 
150, 152 
business logistics, 140-150 
decimal, 96-98 
digital, 7 
digital differential analyzer, 127-132 
evolution, 3 
functional description, 31-35 
general-purpose, 7 
history, 2-6 
information-retrieval, 187-140, 149-150 
large-seale data processor, 160-154 
references, 157 
operating modes, 121 
operations performed by, 26-27 
(See alae ontrion under Tnatruction) 


Computer, parallel, 38 
real-time control, 132-1387 
searching, 1387-140, 149-150 
serial, 38 
special-purpose (see Special-purpose 
computers 
speed, 28, 36, 48, 47, 179-181 
systems-control, 132-137 
terminology, 31-32 
types, 7 
Computer name, COBOL, 276 
Computer words, 31-32, 35-36, 40-41 
Computing unit, 27-28 
of digit-differential analyzer, 128-130 
Concepts, abstract, 363-367 
references, 381-382 
generation, 375 
Concurrent input-output, 105, 151, 153— 
154 
Condition, tally, ALGOL, 221-223 
Condition description, COBOL, 265 
Condition-name, COBOL, 243, 246, 247, 
265 
value, 264—265 
(See also VALUE clause) 
Condition-value entry, COBOL, 265 
Condition-value list, COBOL, 265 
Condition-variable, COBOL, 245-247, 264— 
265 
value, 265 
Conditional forms, ALGOL, 219-220 
Conditional jump, 93, 94, 104 
Conditional probabilities, 341 
Conditional statement, ALGOL, 219-221 
COBOL, 251-253 
differences, 251-252 
Configuration section, COBOL, 276-277 
Conjunction, English, 444 
logical, 437, 438 
Conjunctive normal form, 452 
Connection unit, 369 
Connections, heuristic programming, 358, 
366 
Connector, 69 
variable, 81 
in ALGOL, 227-228 
Consequence solution, 483-484 
Consequences of mathematical results, 
324 
Constant numbers, 66 
Constants, in COBOL, 270-271, 274 
Constrained basis, for dependent elements, 
473 
designation numbers, 469-471 
determination, 468-469 
Constrained Boolean algebra, 471-472 
Constraints, 467—472 
in medical knowledge, 338-339 
in searching problem, 499 
simplest sum of products under, 470— 
471 
Continued fractions, 894-395 
Control, digital, 7 
applications, 126 
ohemloal plant, lde16, 182-1387 
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Control, digital, applications, inventory, 
20-23 
machine-tool, 12-14 
process, 7, 12-15, 132-137 
references, 157 
Control generator, 29-30 
Control panel, 118-122 
buttons and switches, 120-121 
lights and signals, 120 
purpose, 118-120 
Control pulses, 39 
Control signals, 39 
Control unit, 28-30, 39 
Controlled variable, ALGOL, 225 
Convergent, continued-fraction, 394 
Conversion, number, 54-60 
from binary to octal or sexadecimal, 59 
of fractional part, from decimal, 58-59 
to decimal, 60 
of integers, from decimal, 57-58 
to decimal, 60 
Conversion table, 56 
Cookbook language, 200—202 
Core memory, 45, 47 
Cores, magnetic, 45, 48 
CORRESPONDING, MOVE, COBOL, 279 
Counterpoint, 372 
Counters in digital differential analyzers, 
128 
Creativity, 363, 371-375 
Criteria of progress, heuristic programming, 
359 
Current-address counter (see Current 
address register) 
Current-address register, 33-34, 89 
Curve fitting, 14 
(See also Polynomial approximation) 


Data, COBOL, 244-245 
positioning, 280-281 
definition, 31 
end word for, 176, 177 
file, COBOL, 271-272 
organization, bookshelf, 176 
for searching, 173 
threaded list, 177 
Data declarations, cookbcok language, 201, 
202 
Data description, COBOL, 261, 265, 268, 
277-278 
detailed, 263-271 
in MOVE statement, 279 
special-purpose, 264—270 
working storage, COBOL, 270-271 
Data division, COBOL, 244—246, 260 
Data-editing clauses, COBOL, 280-282 
Data handling, COBOL, 277-289 
Data-handling interpretive routines, 172— 
179 
Data-handling statements, COBOL, 278— 
279 
Data-item, COBOL, 242, 246 
elementary, 262—264 
positioning, 280-281 
Data-item description, COBOL, 261 


Data-name, COBOL, 242-244, 247, 249- 
251, 253, 255, 261, 264, 268, 269, 272, 
274, 279, 285 
alternative, 267-270 
partial, 279 
value, 242-256 
initial, 270 
Data-name in constants, COBOL, 274 
Data-name in data-record, COBOL, 274 
Data-name in label record, COBOL, 274 
Data processing, applications, 20-26 
techniques, 383-536 
(See also Business logistic computer; Data 
handling; Information retrieval; 
Searching) 
Data-processing language (see COBOL) 
Data-record name, COBOL, 271, 272 
Data (of program), COBOL, 277 
DATE WRITTEN clause, COBOL, 277 
Debugging, 160-164 
Decimal arithmetic, 61, 96-97 
Decimal-coded binary, 96-97 
Decimal coding system, 96-98 
Decimal conversion, 55—60 
Decimal fraction, ALGOL or COBOL, 198- 
199 
Decimal number, ALGOL or COBOL, 198- 
199 
Decimal point, ALGOL, 198 
COBOL, 198, 264, 281 
(See also Period) 
Decision instructions, 103-105, 112 
coding with, 67-69 
(See also Comparison instruction; Condi- 
tional statement) 
Decision theory, modern mathematical, 
324-333 
references, 378-379 
Decisions, management, 17 
Declaration, ALGOL, array, 226-228, 230 
comment, 233 
procedure, 230—234 
for function, 234-235 
relation to procedure statement, 281= 
232 
switch, 214, 227-228, 230 
type, 226-228 
COBOL (DEFINE statement), 256-257 
Declarations, ALGOL, 225-228, 230-234 
labeling, 216 
cookbook language, 201, 202 
need for, 225-226, 228-229 
Decoder, algebraic symbol, 186-189 
instruction, 28-30, 33, 39 
Decoding and encoding, 521-522 
Deductive inference, 295-296, 324-333 
in abstraction, 363-365 
in abstraction formulation, 367-369 
example in medical diagnosis, 334-348 
DEFINE statement, COBOL, 256-257 
Definite article, English, 301-303, 3006 
Definition of a language, 299 
Definitions, recursive, 198 
syntactical, 194, 197-199 
symbols, 199 
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Delimiter, ALGOL, 206 
COBOL, 242 
De Morgan’s rules, 440 
Dependence, logical, 473-474 
Dependent variable, ALGOL, 229, 231, 234 
COBOL, 255-256 
DEPENDING ON, COBOL, 251 
Derivation of fundamental matrix formulas, 
479-482 
Description, COBOL, condition, 265 
data, 261, 265, 268, 277-278 
detailed, 263-271 
in MOVE statement, 279 
special-purpose, 264-270 
working storage, 270-271 
data-item, 261 
environment, 246, 275-277 
file, 246, 271-275 
identification, 246 
label-record, 271-275 
record, 246, 260-261, 262-263 
Designation numbers, 447-450 
basis, 448-449 
to Boolean function, 451-467, 470-471 
of Boolean functions, 449-450 
Boolean matrices and, 479-481 
of combined elements, 449-450 
under constrained basis, 469-471 
Designational expression, ALGOL, 208, 
214-215, 218, 227 
simple, 215 
Designator, ALGOL, function, 207-208, 
209, 212, 229, 230, 234 
switch, 227-228 
Desk-calculator analogy, 28 
Destination, COBOL, 251 
Devanagari, 195, 196 
Diagnosis (see Debugging; Medical 
diagnosis) 
Diagnostic probability, 343 
alternative, 341-344 
Diagram (see Block diagram; Flow charts) 
Diagram routine, heuristic programming, 
354-355 
Difference equations, 416-421 
Difference quotients, 415-417 
Differences, heuristic programming, 
abstract-concept, 365-366 
trigonometric, 355-359 
polynomial, 395-397 
Differential additions, 127 
Differential equations, ordinary, 410-414 
partial, 414-421 
Differentiation, by definition, 407 
by polynomial approximation, 408-409 
by undetermined coefficients, 410 
Digit, ALGOL or COBOL, 198, 199, 206— 
208 
COBOL, &41 
Digital computer (see Computer) 
Digital differential analyser, 127-182 
computing unite, 128-180 
concept, 127, 120 
programming, 140-189 
purpone, 1&7 


Digital differential analyzer, references, 
156-157 
Digital switching circuits (gates), 36 
Diode memory, 47 
Direct memory addressing, in Honey well 
800 computer, 537, 538 
in IBM 7090 computer, 543 
Discussion (of program), COBOL, 277 
Disease complex, 337 
probability, 343 
Disease profile, 338 
Disjunction, 437, 438 
Disjunctive normal form, 451 
DISPLAY usage, COBOL, 264 
DIVIDE statement, COBOL, 250 
DIVIDED BY, COBOL, 242, 248 
Division, arithmetic, in digital differential 
analyzer, 131-132 
(See also Division instructions) 
COBOL, data, 244-246, 260 
environment, 246n., 277 
(See also Environment description) 
identification, 246n., 277 
(See also Identification description) 
procedure, 244, 246 
Division instructions, types, 102 
do, ALGOL, 221-225 ; 
Dollar sign, COBOL, 242, 263, 282 
“Don’t-care” conditions (see Constraints) 
Drum, magnetic, 45, 46, 143 
type-face, 44 
Drum memory, array of numbers stored, 
143 
computational-array, 143-150 
searching, 149, 150 
Dummy comparisons, 90 
Dummy instruction, 65 
Dynamic program debugging, 160-163 
automonitor, 161-163 
break-point, 161 
Dynamic programming, 330-333 
in medical diagnosis, 344-348 


E function, 479-481 
E matrix, 480-481 
Editing clauses, COBOL, 282 
Editing symbol, COBOL, 242 
EDSAC, 6 
EDVAC, 6 
Efficiency, of codifying, 522-523 
of syntactical and semantic description, 
324 
Element, for, ALGOL, 221 
set, 442 
Elementary data-item, COBOL, 262-264 
positioning, 280-281 
Elementary elements, 438 
independent and dependent, 467-468 
Elementary product, 451 
Elementary sum, 452 
ELSE, COBOL, 251 
Else fragment, COBOL, 251-252 
Blae statement, COBOL, 261-252 
Empty, ALGQOL, 229, 280 
COBOL, 261, 268-254 
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Encoding, 521-522 
End-of-data word, 176, 177 
ENDING-FILE label-record type, COBOL, 
273 
ENDING-TAPE label-record type, COBOL, 
273 
English, analogy with automatic-program- 
ming languages, 194, 197, 200, 202, 
225-226, 240, 248, 253 
logical translation, 444-446 
references, 494 
semantics, 200 
translating into Korean, 301-306 
ENIAC, 2 
Entrance, subroutine, 81 
Environment, COBOL, 246 
Environment description, COBOL, 246. 
275-277 
Environment division, COBOL, 246n., 277 
Environmental factors affecting diagnosis, 
343-344 
Enzyme reactions, analyzing, 484-486 
Equality, of Boolean matrices, 475 
logical, 102--103 
(See also Equivalence) 
EQUALS, COBOL, 249 
Equations, algebraic, 387-393 
Boolean, 482-483 
proving, 440-441 
difference, 416-421 
linear, simultaneous, 384-387 
numerical solutions as computer appli- 
cations, 8-12 
ordinary differential, 410—414 
partial differential, 414—421 
transcendental, 387-393 
Equivalence, Boolean, 439 
Equivalence relation, 498 
Error, curves for, in function evaluation, 
399, 400, 403, 404 
experimental, 422 
pitfalls from 428-429 
relative, 424-426 
sources, 421-423 
truncated, 421-422 
(See also Accuracy) 
Error detecting and correcting codes, 523— 
526 
references, 535-536 
Error propagation of polynomial differences, 
397 
Essential prime implicants, 458 
finding, 461—467 
Etiological factors affecting diagnosis, 343- 
344 
Evolution of digital computer, 3 
Excess-three code (decimal-coded binary), 
96-97 
Exclusive or, 445 
Execution of instruction, 26-27 
Exit, COBOL loop, 284-285 
subroutine, 81 
Expansion, of function (see Functions, 
evaluation) 
of numbers in powers of radix, 54 


Experimental errors, 422 
Exponent, floating-point, 99 
EXPONENTIATED BY, COBOL, 248 
Exponentiation, ALGOL, 212 
COBOL, 248 
Expression, ALGOL, 229 
arithmetic, ALGOL, 203-205, 208, 209, 
211-213, 214, 217, 221, 228 
simple, 211-212, 213 
automatic translation, 311-324 
COBOL, 213, 247-249 
Boolean, ALGOL, 208-211, 217, 219-220 
simple, 209-210, 211 
COBOL, 247-248, 251 
designational, ALGOL, 208, 214-215, 218, 
227 
simple, 215 
Ixpressions, ALGOL, 206-215 
automatic-programming language, 197 
Boolean, in logic, 489-442 
pictorial, in abstraction, 364-367 
Extract instruction, 102-103, 136 
in use, 1383-135, 167, 171 


F, COBOL, 241 
(See also Logical value) 
F function, 479-481 
F matrix, 480-481 
Factor, ALGOL, 212 
False (see Logical value) 
False drop, probability, 529-532, 534 
FD, COBOL, 271 
Field, computer-memory, COBOL, 279-281 
File, COBOL, 246, 257-260, 271 
opening and closing, 283 
storing, 257-260 
FILE CONTAINS clause, COBOL, 271-272 
FILE CONTROL clause, COBOL, 276, 277 
File data, COBOL, 271-272 
File description, COBOL, 246, 271--275 
File-handling statements, COBOL, 282—285 
File-name, COBOL, 271, 275, 283, 284 
File-name list, COBOL, 283 
File-name options, COBOL, 283 
File-name options list, COBOL, 283 
File organization, COBOL, 271-272 
FLOAT DOLLAR SIGN clause, COBOL, 
282 
Floating point, 98-100 
simulation routine, 168-172 
Flow charting, 69-70 
symbols, 69, 74 
Flow charts, accelerating convergence of 
iterative processes, 392 
algebraic-compiler, 188, 190 
COBOL conditional statements, 252 
compiler routine, 184 
computation, of radiation intensity, 83 
of VN, 75 
diagnostic-testing iteration, 340 
dynamic diagnostic-treatment cycle, $48 
evaluation of single function, 190 
forming COBOL loops, 255 
generalized loop, 74 
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Flow charts, input-output routine, 115 
instruction modification, 78 
interpretive routine, 166 

sequencing, 170 
loops within loops, 80 
merging of ordered lists, 516 
missile trajectory, 73 
ordering by merging, 518 
real-time operating program, 135 
processing COBOL records, 260, 284 
recording common numbers on two 
ordered lists, 175 
setting up subroutines, 82 
solution, of simultaneous linear equations, 
387 
of sin z = 2 — 2z2, 890 
split-tree method of table look-up, 117 
table look-up routine, 77 
translator, detailed, 308-309 
gross, 307 
use, 301 

For element, ALGOL, 221 

For list, ALGOL, 221-223 

for statement, ALGOL, 219, 221-225 
COBOL (PERFORM statement), 253- 

255 

Formal parameter, ALGOL, 230-232 

Formal-parameter list, ALGOL, 230 

Formal-parameter part, ALGOL, 230 

Format, in COBOL, 244-245, 256 

defining new, 256 
instruction (see Instruction format) 
label-record, COBOL, 274 
(See also Label-record description) 
list-processing language, 298 
one-address, simulated, 169 
output, COBOL, 280-282 
record, COBOL, 260-263 
(See also Record description) 
word, 39, 62-66, 88, 90, 93, 95-96 
Honey well 800 computer, 538 
IBM computer, 542 
UNIVAC 1103A computer, 541 

Forms, conditional, ALGOL, 219-220 
grammatical, 194, 200-202 

Formulas, well-formed, in geometry, 350 

in heuristic programming, 357 

Formulation of abstractions, 367-371 

FORTRAN (see ALGOL) 

Fraction, decimal, ALGOL or COBOL, 198- 

199 
FROM, COBOL, 249, 250, 253, 285 
FROM clause, COBOL, 285 
From list, COBOL, 253-254 
Foundations of Boolean algebra, references, 
493-404 

Four-address format, 39, 62-65 

Function designator, ALGOL, 207-208, 209, 
212, 220, 280, 234 

Function subroutine, 164-165 
reused, 180, INL 

Functional caloulus of aymbolie logie, 486 

Funetional deseription of computer, 41-45 

Functions, Boolean Gee Hooiean finetion) 
evaluation, WOU 404, 400-410 


Functions, evaluation, best-fit polynomial 
approximations, 402-404 
expansion, continued-fraction approxi- 
mation, 394-395 
series of approximations, 393-394 
flow chart, 190 
interpolation, 398—401 
Aitken’s method, 400-401 
polynomial approximations, 393, 398- 
401 
undetermined coefficients, 409-410 
given in logical problems, 483 
logical, ALGOL, 210 
medical knowledge, 337 
procedure statements, ALGOL, 234-235 
(See also Numerical analysis; Sub- 
routine) 
Fundamental formulas, Boolean trans- 
formation, 481—482 


Game theory, 328-330 
Gates, 36 
digital differential analyzer, 128-130 
General problem-solving program, 355-362 
General-purpose computer, 7 
Generalized address, 143-144 
Geometry, plane, 349-354 
GIVING, COBOL, 249-250 
go to statement, ALGOL, 214, 218-219 
with conditional statement, 220-221 
in for statement, 225 
in switch declaration, 227-228 
GO TO statement, COBOL, 250-251, 253 
Grammar, ALGOL, of blocks, 228 
statement, assignment, 218 
conditional, 221 
for, 225 
go to, 219 
procedure, 229 
Grammatical forms of language, 194, 200- 
202 
Greatest lower bound of matrix solutions, 
476-478 


Hardware, in COBOL, 275-277 
Hardware list, COBOL, 275, 276 
Hardware-name, COBOL, 275-276 
Harmonic rules, 372-373 
Harris-Elbourn method of simplification, 
460-467 
HASHED option, COBOL, 273-274 
Heading, procedure, ALGOL, 230-232 
Health-computing system, 25-26 
Heuristic programming, 348-355 
comparison with translator, 355 
references, 379-381 
technique, 351-354 
Heuristic routine, 354-355 
Heuristics, 349, 351 
semantic, 352, 354 
source in geometry, 350, 351 
syntactic, 852, 353 
ayntactionymmetry, 353 
types, 465-857 
Hloxaceolmal (ae Sexadeoimal) 
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High-speed printer, 43-45 
History, ALGOL, 203 
Boolean algebra, 435-436 
COBOL, 240-241 
computers, 2-6 
Hodgkins-Huxley equations, 11 
Honey well 800 computer,.537—541 
House, idealized concept, 366 
Hypercube representation of Boolean 
algebra, 446 
Hypotheses, good, choosing, 349 
in logical problems, 483 
Hyphen, COBOL, 242 


IBM 7090 computer, 542-544 
Identification, abstract, 363-367 
program, COBOL, 277 
Identification description, COBOL, 246 
Identification division, COBOL, 246n., 277 
Identifier, ALGOL, 207n. 
procedure, 207-208, 229, 230 
automatic-programming language, 197 
IF, COBOL, 251 
If clause, ALGOL, 211, 213-215, 219 
If fragment, COBOL, 251-252 
if statement, ALGOL, 219 
COBOL, 251-252 
Illiac, 375n. 
Implicant, definition, 454 
(See also Prime implicant) 
Implication, ALGOL, 210 
Boolean matrices, 474—475 
logical, 439 
In-out file-name list, COBOL, 283 
In-out selector, 30, 153 
(See also Interlocks) 
INCLUDE statement, COBOL, 254-255, 
256 
Inclusive or, 445 
Independence, logical, 472-473 
Independent symptoms, 344 
Independent variable, ALGOL, 208, 229-— 
231, 235 
COBOL, 255-256 
Independent-variable list, ALGOL, 208, 234 
Index, ALGOL, array declaration, 226-227 
for statement, 223, 225 
translator subroutine, 311 
Index-manipulation instructions, IBM 7090 
computer, 543-544 
Index registers, 110 
Honey well 800 computer, 537-540, 544 
IBM 7090 computer, 542-544 
in list processing, 298 
use in push-down lists, 178-179 
Indexing (see Tabledex) 
Indirect addressing (see Addressing) 
Inductive inference, 295-296, 348-355 
in abstraction, 365-367 
example of general problem solver, 355- 
362 
Industrial dynamics, 15-19 
Inequality, logical, 102 
Inference (see Deductive inference; Induc- 
tive inference) 


Information retrieval, 8 
references, 534—535 
(See also Data processing) 
Ingredients, of formal mathematical sys- 
tem, 350 
of a language, 194-197 
of medical diagnosis, 335 
INPUT, COBOL, 283 
Input-file area, COBOL, filling, 283-284 
transfer from, 278-279 
Input files, COBOL, 257-260, 283 
Input numbers, 6-7, 26 
Input-out put, concurrent, 105, 151, 153-154 
Input-output equipment, in COBOL, 275— 
277 
Input-output methods, 30, 41-44, 45 
speeds, 43 
Input-output problems, large-scale data 
processor, 150-151 
Input-output section, COBOL, 276-277 
Inscriber, 41 
Instruction decoder, 29-30, 33, 39 
Instruction execution, 26—27 
Instruction format, business logistic com- 
puter, 142-145 
four-address, 39, 62-65 
list-processing language, 298 
one-address, 93-94 
IBM 7090 computer, 542 
simulated, 169 
retrieval computer, 138 
three-address, 88 
Honeywell 800 computer, 538 
two-address, 90 
UNIVAC 1103A computer, 541 
Instruction modification, 32, 77-79, 111 
Instruction register, 33, 39 
Instruction sequences, 62-67 
Instruction types, 98-106, 112 
arithmetic, 100-102 
bit-handling, 102-103, 136 
close-up-space, 176 
comparison of words, 67, 104, 147 
decision, 67-69, 103-105 
(See also Conditional statement) 
extraction, 103-136 
using, 167, 171 
Honeywell 800 computer, 540 
IBM 7090 computer, 543-544 
input-output, 105-106 
jump, 104, 112 
list-processing language, 297—298 
logical, 102-103, 136 
make-space, 176 
one-address, simulated, 169 
read and’ write, 105-106 
shift, 103 
switch sensing, 104 
transfer, 167, 170 
translator target language, 311 
UNIVAC 1103A computer, 541-542 
Integer, ALGOL, 227 
COBOL, 263-265, 271, 272, 281, 285 
unsigned, ALGOL or COBOL, 108-109, 
215 
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integer type, ALGOL, 226, 230 
Integrating general- and special-purpose 
computers, 139-140, 151-154 
Integration, by definition, 404-405 
on logistics computer, 148 
by polynomial approximations, 406-407 
by Simpson’s rule, 406-407, 409-410 
by trapezoid rule, 406 
in ALGOL, 233 
by undetermined coefficients, 409-410 
Intelligence achievement, mathematics in, 
325 
programming in, 295-382 
Intelligence reports, logical analysis, 486- 
487 
Interlocks, 150-154 
International Algebraic Language (see 
ALGOL) 
Interpolation, 398-401 
Interpretation of word, 32, 35 
Interpretive routines, 164-179 
complications, 179-180 
data-handling, 172-179 
mathematical, 164-168 
simulational, 168—172 
Intersection of sets, 443 
INTO, COBOL, 250 
Intuition, 349 
Inventory control, 20-23 
(See also Business logistic computer) 
Inversion, logical, 450 
(See also Complement) 
Irredundant codifying, 526-532, 534 
definition, 522 
Irrelevant symptoms, 369 
IS EQUAL TO, COBOL, 242, 247-248 
IS GREATER THAN, COBOL, 242, 247-— 
248 
IS LESS THAN, COBOL, 242, 247-248 
IS options, COBOL, 272, 273 
Item, 148-149, 495 
Item-list searching, 500 
Iteration, 74 
COBOL, 254 
diagnosis-testing, 335, 339-340 
diagnosis-treatment, 344-348 
difference equation, 418—420 
in dynamic programming, 333 
in successive approximations, 387-393 
(See also Recursion codes) 
Iterative process, accelerating, 389-393 


Jacquard loom, 2 
Jump, 80n. 
negative, 104 
Jump instructions, 104, 112 
ALGOL, 214 
(See also go to statement, ALGOL) 
conditional, 08, 04, 104 
(See also Comparison instruction; 
Conditional atatement; Decision 
inatructiona) 
IBM 7000 computer, 644-544 
unconditional, 01, 04 
UNTIVAC 110A computor, 642 


Jump table, 113, 114, 133 
JUSTIFIED clause, COBOL, 280-281 


Key component, translator, 307, 310 

Key symbol, translator, 307, 310 

Korean, translating English into, 301-306 
Korean alphabet, 195, 196 


Label, ALGOL, 215-217, 219, 228 
if statement, 219-220 
in switch declaration, 227 
automatic-programming language, 197, 
214 
of block of data, 173 
COBOL, 246, 251, 253, 255 
Label list, ALGOL (switch list), 227 
COBOL, 251 
Label record, COBOL, 272-275 
Label-record description, COBOL, 271— 
275 
Label-record type, COBOL, 271-273, 274 
Label-record-type list, COBOL, 273-274 
Labeling words of lists, 173 
Language, anatomy, 204 
automatic-programming (see Automatic- 
programming language) 
cookbook, 200—202 
definition, 299 
ingredients of, 194-197 
list-processing, 296-298 
object, 299n. 
source, 299 
target, 299 
instructions for translator, 311 
(See also ALGOL; COBOL) 
Laplace’s equation, 416, 418-421 
Lattice array in solving partial differential 
equations, 416-421 
Learning, concept, 369-371 
heuristic programming, 362 
Least upper bound of matrix solution, 475— 
478 ‘ 
Left part, ALGOL, 217 
Left part add, COBOL, 249 
Left part compute, COBOL, 249 
Left part perf, COBOL, 253 
Left part sub, COBOL, 250 
Length of word, 35 
Letter, ALGOL, 206-208 
COBOL, 241 
Level number, COBOL, 261-263, 265, 268 
syntax definition, for translator, 304, 306, 
310-311 
Level 66, COBOL, 269 
Level 77, COBOL, 270 
Level 88, COBOL, 265, 271 
Library of subroutines, 84-85 
Line, syntax definition, 306 
Linear algebraic equations, simultaneous, 
884-387 
Linear Boolean equations, 482-483 
Linear ordering, 498 
Linear programming, 325-326 
LINES, COBOL, 285 
Link, let, 207 
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List, ALGOL, actual-parameter, 229, 234 
for, 221--223 
formal-parameter, 230 
independent-variable, 208 
label (switch), 227 
variable, 226, 230 

characteristic, 495 
COBOL, addressed, 249, 250 
assignment, 276 
condition-value, 265 
file-name, in-out, 283 
file-name-options, 283 
from, 253-254 
hardward, 275, 276 
label, 251 
label-record type, 273-274 
memory-segment, 276 
partial-data-name, 279 
receiving-area, 279 
receiving-field, 279 
record-name, 272 
replacement, 255 
end of, word for, 176, 177 
item, 495 
ordering, 498, 510-521 
population-based, 503 
push-down, 178-179, 296-297 
translator, 311n., 313 
searching, 500-501 
source code, 306 
substantive-based, 503 
threaded, 177 
push-down lists in, 179 
List processing, 296-298 
references, 377-378 
Literal, COBOL, 242-243, 246, 247, 249, 
265, 276 
condition-name, 243 
English, 304 

Local, ALGOL, 216-217, 219, 228 

LOCK option, COBOL, 283 

Logic (see Boolean algebra; Symbolic logic) 

Logical addition, 102, 437, 438, 449 

Boolean matrix, 474 

Logical basis (see Basis) 

Logical comparison, 102-103 

Logical-control computer, 132-137 

Logical dependence and independence, 
472-474 

Logical equality, 102-103 

Logical inequality, 102 

Logical instructions, 102-103, 136 

Honey well 800 computer, 540 
IBM 7090 computer, 544 
UNIVAC 1108A computer, 541 
Logical multiplication, 102, 487, 438, 449- 
450 
Boolean matrices, 474 
Logical operation, COBOL, 247 
(See also Logical operator) 
Logical operations, 437-438, 441-442 
(See also Boolean operations) 
Logical operator, ALGOL, 206 
COBOL, 242 
(See also Logical operation) 


Logical problems, 483-484 
Logical translation of English, 444-446 
references, 494 
Logical value, ALGOL, 206, 209 
in ALGOL Boolean expression, 208-211 
COBOL, 241, 247 
Logistic computer (see Business logistic 
computer) 
Look-up routine, binomial-coefficient, 78 
Loops, in ALGOL, 221 
in COBOL, flow chart, 255 
within loops, 79-80 
in ALGOL, 225 
in COBOL, 254 
(See also Recursion codes; Recursive 
compiler) 
Lowest solution, Boolean matrix equations, 
476-478 


Magnetic tape, 41-43, 105 
in COBOL, 244-246, 257-260, 262, 271- 
274 
rewinding, 283 
Major multiplication, 101 
Make-space pseudoinstruction, 176 
Man-machine simulation, 18-21 
Management decisions, 17 
Mantissa, floating-point word, 99 
Manual searching methods, 504-506 
Marginal-punch cards, 504, 532, 533 
Masked instructions, Honey well 800 com- 
puter, 540 
Mathematical decision theory, 324-333 
references, 378-379 
Mathematical interpretive routine, 164—168 
Mathematical optimization, 324 
(See also Deductive inference) 
Mathematical system, formal, 350 
Matrix, in ALGOL, 226—227 
Boolean (see Boolean matrices; Boolean 
matrix equations) 
in COBOL, 265, 267 
permutation, 476 
pseudopermutation (unitary matrix), 
476-478 
transformation, Boolean, 479-482 
Matrix equation for continued fraction, 
394-395 
(See also Boolean matrix equations) 
Matrix multiplication, in ALGOL, 233 
Medical diagnosis, aids to, 25-26 
references, 379 
factors affecting, 343 
formulation, 334-335 
ingredients, 335 
programming for, 334-348 
reasoning in, 334-335 
Medical knowledge, 335-338 
probabilities in, 341-344 
Melodic rules, 372, 373 
Memory, 27-28, 31, 43-48 
high-speed, in COBOL, 259-260, 268, 270, 
272, 274, 276, 283 
sharing, 179 ; 
reading from, 47-48 
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Memory, required for compiling routine, 189 
speed of, 47 
Memory access time, definition, 47 
Memory address, definition, 31 
Memory segment, COBOL, 276 
Memory-segment list, COBOL, 276 
MEMORY SIZE clause, COBOL, 276-277 
Memory space, sharing, 179 
Memory systems, 44-48 
Merging, ordering by, 515-521 
Metalanguage, 199, 299, 301 
Meta-metalanguage, 199 
Minimization, linear-programming, 326 
Minimum, maximizing, 329 
Minimum-access coding, 96 
Minor multiplication, 101 
MINUS, COBOL, 242-248 
Mixed strategy, 329 
Models, mathematical (see Simulation) 
Modes of operation, 121 
Modification, automatic-programming 
language, 299 
instruction, 32, 77-79, 111 
Mongrel form, 452-453 
MOVE CORRESPONDING statement, 
COBOL, 279 
MOVE statement, COBOL, 278-279, 280- 
282, 285 
Mult-factor, ALGOL, 212 
Mult operators, ALGOL, 212 
Multiple codes, 152-153 
Multiplication, and division, 61-62 
floating, 99-101 
logical, 102, 437, 438, 449. 450 
major, 101 
matrix, in ALGOL, 233 
Boolean, 474 
minor, 101 
Multiplication instructions, types, 101 
Multiplication tables, 61 
MULTIPLY statement, COBOL, 250 
Multiway branch, coding, 112-113 
Music composition by computer, 371-375 


n-phrase, English and Korean, 301-303, 306 
Name (of program author), COBOL, 277 
Napier’s ‘‘bones,’’ 2 
Neg-term, ALGOL, 210 
Negation, 437, 438 
(See also Complement) 
Negative jump, 104 
(See also Conditional jump) 

Nets, self-organizing, 369 
Neural-net simulation, 369 
Neuron, 369-370 
9 (numerical character), COBOL, 263 
NO REWIND option, COBOL, 283 
Nonconditional statement, COBOL, 284 
Noncontiguous working-storage, COBOL, 

270 
Nonlocal, ALGOL, 217, 228 
Normaliging, 00 
Normalising inatruetion, UNIVAC LIO8A 

computer, 642 
NOT, COBOL, 248, 247-248 


not, logical, 437, 438 
Notation, conditional probability, 341 
translator, 303, 312 
(See also Symbols) 
Noun, English and Korean, 301-303, 306 
Number, ALGOL, 206 
ALGOL or COBOL, 198-199 
automatic-programming language, 197 
decimal, ALGOL or COBOL, 198-199 
level, COBOL, 261-263, 265, 268 
Number systems, 54-62 
Numbers, input-output, 6-7, 26, 40 
random, in musical creativity, 329, 372- 
375 types in computers, 66 
NUMERIC class, COBOL, 264 
Numerical analysis, 383-434 
references, 433-434 
Numerical approximations, errors in, 422 
Numerical data, COBOL, standard posi- 
tioning, 280-281 
Numerical solutions to equations, 8-12 
Numerical value, 211, 213-214 


Object, heuristic programming, in abstrac- 
tion, 365 
OBJECT-COMPUTER clause, COBOL, 
276 
Object language, 299n. 
Object phrase, English and Korean, 301 
OBJECT-PROGRAM assignment, COBOL, 
276 
OCCURS clause, COBOL, 265-267, 271 
Octal number system, 55, 56 
addition tables, 61 
conversion, 55-60 
multiplication tables, 61 
OMITTED label-record description option, 
COBOL, 273 
One-address format, 93-94 
One-address instruction system, 93-95 
example, 542-544 
simulating in two-address, 168-172 
One-plus-one instruction address system, 
95-96 
OPEN statement, COBOL, 283, 284 
Operand, relational, COBOL, 247 
Operation, ALGOL, Boolean, truth-value 
definition, 208 
COBOL, logical, 247 
relational, 247, 248 
propositional, 437-438 
(See also Operator) 
Operational characteristics, COBOL, 
description, 263-264 
Operator, ALGOL, adding, 212 
arithmetic, 206, 212 
logical, 206 
mult, 212 
relational, 206, 209 
COBOL, arithmetic, 242 
logical, 242 
relational, 242 
heuristic programming, in abstraction, 
865 
transformation, 352 
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Operator, heuristic programming, in trig- 
onometry, 356-357, 358-359 
(See also Operation) 
Optimality principle, 333 
Optimization, mathematical, 324 
(See also Deductive inference) 
Option-value, COBOL, 245, 264-265 
OR, COBOL, 242, 247-248 
or, inclusive and exclusive, 445 
logical, 437 
Ordering, 510-521 
definition, 498 
methods, comparison, 520-521 
maximum in-pass, 510-511 
minimum in-pass, 510-511, 512 
merging, 515-521 
radix coefficients, 512-515 
transposition, 511-513 
population-based, 503 
references, 534 
substantive-based, 503 
Organization, COBOL, file, 271-272 
over-all, 244-246 
data, bookshelf, 176 
COBOL, 257-263, 271 
for searching, 173 
threaded list, 177 
Oscillators, clock, 36 
Out-scriber, 43 
OUTPUT, COBOL, 283 
Output devices, 30, 41-45 
Output file, COBOL, 257-260, 283 
Output-file area, COBOL, emptying, 283-— 
284 
transfer to, 278-279 
Output format, COBOL, 280-282 
Output numbers, 6-7, 26 
Overflow, 100 
in digital differential analyzer, 128, 129 


Paper tape, punched, 30, 40-41, 105 
Paragraph, COBOL, 2438, 246, 255 
GO TO statement, 251 
Parameter, ALGOL, actual, 229-232 
formal, 230—232 
Parentheses, 187 
in ALGOL, 207-209, 212, 215, 229-231 
translating, 316, 317 
omitting, 209, 212 
Partial addressing, 107 
Partial data-name, COBOL, 279 
Partial-data-name list, COBOL, 279 
Partial differential equations, 414—421 
Pattern recognition, 363-367, 369-371 
references, 381-382 
Payoff, game theory, 329 
Peek-a-boo cards, 504-505 
Perceptron, 369 
PERFORM statement, COBOL, 253-255 
Period, COBOL, 242, 246, 251, 263 
(See also Decimal point) 
Permanent conditions in searching, 508 
Permanent numbers, 66 
Permutation matrix (unitary matrix), 476- 
478 


Phase-selector switch, 121 
Phases, operational, 34 
Philosophy of programming, 54 
Physical characteristics, COBOL, descrip- 
tion, 263-264 
Pictograph, 195, 196 
Pictorial expressions in abstraction, 364-367 
PICTURE clause, COBOL, 263-264, 270, 
281 
Picture-reading machine, 14 
Pitfalls in using computer, 428-429 
PLACES, COBOL, 281 
Plane geometry, 349-351, 352-354 
Planning, heuristic programming, 362 
Players, game theory, 328 
PLUS, COBOL, 242, 248 
Point (see Decimal point) 
POINT LOCATION clause, COBOL, 281 
Polynomial, differences, 395-397 
Polynomial approximation, 393, 398-401 
best fit, 402-404 
differentiation by, 408-409 
integration by, 406-407 
series, 393-394 
solving differential equations by, 411-413 
Polynomial interpolation, 398-401 
Popping up, 178 
Population of patients, 343 
Population-based ordering, 503 
Position, elementary data-item, COBOL, 
280-281 
Post-mortem program debugging, 160-161, 
163-164 
Power positions, definition, 461 
Power series approximation, 393-394 
Precedence rules, ALGOL, arithmetic, 212 
relational and logical, 209 
COBOL, arithmetic, 248 
logical, 248 
Premises in logical problems, 483 
Prime implicant, 454—455 
charts, 457-459 
essential, 458 
obtaining directly, 460-467 
generating, 455-457 
Principle of optimality, 333 
Principles, coding, 53 
Probabilities, conditional, 341 
under conditions, of risk, 327-328 
of uncertainty, 329 
diagnostic, 343 
alternative, 341-344 
in dynamic programming, 330-333 
in medical diagnosis, 341-348 
transition, 330-333 
in treatment, 345-348 
Problem-solving program, general, 355-362 
Procedure, cookbook language, 201, 202 
Procedure body, ALGOL, 230-232 
Procedure declaration, ALGOL, 280-284 
for function, 234-235 
relation to procedure statement, 281— 
232 
COBOL (DEFINE. statement), 266-257 
cookbook language, 201, 202 
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Procedure division, COBOL, 244, 246 
Procedure heading, ALGOL, 230-232 
Procedure identifier, ALGOL, 207-208, 229, 
230 
Procedure-name, COBOL, 243, 246 
Procedure statement, ALGOL, 219, 229— 
230, 232 
as function, 234-235 
relation to procedure declaration, 231— 
232 
COBOL (INCLUDE statement), 254-255, 
256 
cookbook language, 201 
Product, elementary, 451 
transposition of Boolean matrices, 476 
Product-term, ALGOL, 210 
Profile, symptom-disease, 338, 342 
Profit in dynamic programming, 330-333 
Program, automatic, 159 
automonitor, 161-163 
COBOL, 244, 276 
debugging, 53, 160-164 
definition, 7, 53 
general problem-solving, 355-362 
intermediate, 159 
(See also Programming) 
PROGRAM ID clause, COBOL, 277 
Program identification, COBOL, 277 
Programmer, definition, 7, 41 
role, 6-7 
Programming, to achieve intelligence, 295- 
382 
automatic (see Automatic programming) 
business logistic computer, 147-148 
digital differential analyzer, 130-132 
dynamic, 330-333 
in medical diagnosis, 344-348 
heuristic, 348-355 
comparison with translator, 355 
references, 379-381 
technique, 351-354 
linear, 325-326 
philosophy of, 54 
real-time control computer, 133-135 
stages, 53-54, 71-72 
for Tabledex, 505-506 
Proof, geometric, 350 
Propagation of error in polynomial differ- 
ences, 397 
Propositional caleulus, 437 
in medical diagnosis, 335-339 
(See also Symbolic logic) 
Propositional operations, 437-438 
Pseudoinstructions, 298 
Pulses, 35-36 
Punched cards, 30, 41, 43, 105 
(See also Marginal-punch cards; Peek-a- 
boo cards) 
Punched paper tape, 30, 40-41, 105 
Pure strategy, 320 
Push-down list, 178170, 206-297 
translator, lin, $18 
Pushbuttons, 120-129 
Pregees hourlatlo programming eriteria, 
i - 


Quine-McCluskey simplification, 454-459 
Quotation marks, COBOL, 242-243, 246, 
265 


R matrix, 479-480 
Radioactive tracer study, 11, 12 
Radix, 54 
ordering by, 512-515 
Random numbers, in game theory, 329 
in musical creativity, 372-375 
Rate equations, 11 
Reactions, enzyme, analyzing, 484-486 
Read-in, initiating, 122 
Read-in routine, initial, 115-116 
READ statement, COBOL, 283-285 
Read and write instructions, types, 105-106, 
115-116 
Reading from memory, 47-48 
Real-time control computer, 132—137 
external-control instruction, 136 
flow chart, 135 
programming, 133-135 
real type, ALGOL, 226, 230 
Reasoning in medical diagnosis, 334-335 
Receiving-area list, COBOL, 279 
Receiving field, COBOL, 279, 281 
Receiving-field list, COBOL, 279 
Recognition, abstraction (pattern), 363— 
367, 369-371 
references, 381-382 
Record, COBOL, 244, 246 
format, 260-263 
label, 272-275 
processing, 259-260, 282-285 
storing, 257-260, 262, 268, 270, 272 
(See also Data-record) 
Record area, COBOL, 272 
RECORD CONTAINS clause, COBOL, 272 
Record description, COBOL, 246, 260-261 
262-263 
Record-handling statements, COBOL, 283- 
285 ; 
Record-name, COBOL, 272, 285 
Record-name list, COBOL, 272 
Recursion codes, 72-76 
in ALGOL, 221 
(See also Loops) 
Recursive compiler, 190-191 
Recursive definition, 198 
REDEFINES clause, COBOL, 267-268, 271 
Redundant codifying, 523-526 
definition, 522 
Redundant symptoms, 369 
Reel, tape, labeling in COBOL, 272, 274 
References, ALGOL, 238-239 
automatic programming, 192 
automatic-programming translation, 378 
Boolean algebra, foundations and axio- 
matic approach, 493-494 
COBOL, 291-294 
computer applications, 49-52 
computer manuals, 125 
computer-systems control, 157 
decision theory, 378-379 
digital differential analyzer, 156-157 
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References, error detection and correcting 
codes, 535-536 
heuristic programming, 379-381 
information retrieval, 534-535 
instruction formats and operation types, 
124-125 
list processing, 377-378 
logical translation of English, 494 
medical diagnosis, 379 
multiple-cOémponent computing systems, 
157 
numerical analysis, 433-434 
pattern recognition and concept forma- 
tion, 381-382 
simplification of Boolean functions, 494 
sorting and ordering, 534 
symbolic logic, formal theory, 493 
Registers, Honey well 800 computer, 537— 
540, 544 
index, 110 
use, in list processing, 298 
in push-down lists, 178-179 
IBM 7090 computer, 542-544 
instruction, 39 
lights for contents, 120 
manual cleaning, 121 
UNIVAC 1103A computer, 541 
Reinforcement rules, neuron, 370 
Relation, ALGOL, 209 
COBOL, 247, 253 
syntactical, in abstraction, 363-364 
Relational operand, COBOL, 247-248 
Relational operation, COBOL, 247 
(See also Relational operator) 
Relational operator, ALGOL, 206, 209 
COBOL, 242 
(See also Relational operation) 
Relative addressing (see Addressing) 
Relative error, 424-426 
Relaxation of difference equations, 419-421 
Relaxed conditions of searching, 506-510 
Remarks, coding sheet, 65 
REMARKS clause, COBOL, 277 
RENAMES clause, COBOL, 268-270 
Repeat instruction, UNIVAC 1103A com- 
puter, 541 
Replacement, COBOL, 255 
Replacement list, COBOL, 255 
REPLACING, COBOL, 255 
Representations of Boolean expressions, 
hypercube, 446 
lattice diagram, 446-447 
set, 442-444 
Venn diagram, 444, 445 
Reservation system, airline, 23-25 
Residual, difference equation, 420 
digital differential analyzer, 128, 129 
Result, 26 
Retrieval, information, 25, 26 
(See also Data processing; Searching) 
Retrieval computer, 137--140, 149-150 
Return jump instruction, 104, 165 
Rewind options, COBOL, 283 
Rhythm, 371, 373-375 
Ring addition, 102 


Risk, decision under, 325-328 
Round numbers, 423-424 
Round-off errors, 422 
Rounding, 423 
Routine, 53 
(See also Subroutine) 
Rule, Simpson’s, 406-407, 409-410, 413 
trapezoid, 406 
Rules, precedence, ALGOL, arithmetic, 
212 
relational and logical, 209 
COBOL, 248 
logical, 248 
reinforcement, neuron, 370 
selection, musical, 372-373 
significant figure loss, 426-428 
Runge’s method for differential equations, 
413-415 


S (sign), COBOL, 263 
Scaling, 69, 99, 131 
Scope of local variable, ALGOL, 228 
SEAC, 6 
Search, conventional, 173-176 
(See also Searching) 
Search pseudoinstruction, 174 
Searching, definition, 495 
drum, 149, 150 
ingredients, 495 
methods, 500-510 
with computers, 500-503 
manual, 504-506 
Tabledex, 502-503 
with permanent conditions, 508 
with preferential conditions, 508-509 
problems in terms of Boolean algebra, 
499-500 
programming for, 173-176 
for r out of n characteristics, 507-508 
with relaxed conditions, 506-510 
(See also Codifying; Data processing; 
Sorting) 
Searching computer, 137-140, 149-150 
SELECT, COBOL, 275 
Selection, of arguments, 28-30, 33-34, 39 
(See also Addresses; Addressing; 
Memory) 
rules, musical, 372-373 
Selector, in-out, 30, 153 
(See also Interlocks) 
Self-correcting redundant codification, 528= 
526 
Self-organizing nets, 369 
Semantic discussions of language, 194, 100 
200 
Semantic heuristic, 352, 354 
Semantic routine, heuristic programming, 
354 
Semantics, ALGOL, assignment statement 
(see Statement) 
conditional statement, 219-221 
for statement, 221-225 
go to statement, 218-219 
procedure declaration, 281-282 
procedure statement, 220 
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Semantics, ALGOL, simple arithmetic 
expression, modified, for efficient 
translation, 820-321 

for translator, 312 
English-Korean, 301-304 
of language, 199-200 
Semicolon, ALGOL, 205, 216, 230 
COBOL, 242, 246, 251, 261, 284 

Sending area, COBOL, 279 

Sending field, COBOL, 279, 281 

Sense-indicator instructions, IBM 7090 

computer, 544 

Sentence, COBOL, 246 

English and Korean, 301 
Sequences of decisions, 330-333 

in medical diagnosis, 334-335 
Serial computers, 38 
Series approximation, 393-394 

(See also Polynomial approximation) 
Sets, 442-444 

calculus (see Boolean algebra) 
Sexadecimal, 55, 56 

conversion, 55-60 

Sharing high-speed memory, 179 

Sheffer stroke, 442n. 

Shift instructions, IBM 7090 computer, 543 

types, 103 

Sign, COBOL, 263-264 

medical, 335n. 
(See also Symptom) 
(See also Break-point) 

Sign bit, 68-69 

SIGNED, COBOL, 264 

Significant figures, 423-424, 425-429 

Simple arithmetic expression, ALGOL, 211- 

212, 213 
automatic translation, 311-324 
Simple Boolean expression, ALGOL, 209— 
210, 211 
Simple designational expression, ALGOL, 
215 

Simple tally condition, COBOL, 253-254 

Simple variable, ALGOL, 207, 208, 230 

Simplest product of sums, 451, 489-490 

Simplest sum of products, 454 

constrained basis, 470-471 
methods, decimal, 459-460 
Harris-Elbowin, 460-467 
Quine-McCluskey, 454-459 
Simplification of Boolean functions, refer- 
ences, 494 
Simpson's rule for integration, 406-407, 
409-410, 413 
Simulation, 8 
industrial dynamics, 15-19 
man-machine, 18-21 
neural net, 369 
Simulational interpretive routine, 164, 168 
172 

Simultaneous Boolean equations, 483 

Simultaneous linear equations, 884-387 

SIZE clause, COBOL, 208, 265, 270 

Sort, almple, 607 

Sorting, definition, 408 

marginalepunohed oarda, 604, 648, 644 


Sorting, references, 534 
Source code list, 306 
Source language, 299 
Space, memory, sharing, 179 
Spare parts inventory, 140-141 
Special-purpose computers, 126-157 
business and logistics, 140-150 
digital differential analyzer, 127-132 
information retrieval, 137-140, 149-150 
real-time logical control, 132-137 
references, 156—157 
Specification part, ALGOL, 230-231 
Specifier, ALGOL, 230 
Speed, of computer, 28, 36, 43, 47, 179-181 
of information conversion from one 
medium to another, table, 43 
Split tree, 113 
Standard address for return jump, 165 
Standard basis, 449 
STANDARD label-record description 
option, COBOL, 273 
Standard position, elementary data-item, 
COBOL, 280-281 
State, of health, 345-348 
of a system, 325 
alternative, under risk, 326-327 
in dynamic programming, 330-333 
Statement, ADD, COBOL, 249-250 
ALGOL, 215-225, 219, 221, 230 
labeling, 216 
in procedure declaration, 230-231 
types, 215 
ALTER, COBOL, 250-251 
assignment, ALGOL, 203-205, 217-219, 
228 
automatic translation, 311-324 
modified syntax, for efficient trans- 
lation, 320-321 
for translator, 312 
COBOL, class, 249-250 
automatic-programming language, 197 
CLOSE, COBOL, 283, '284 
COBOL, 2438, 246 
compound, ALGOL, 215-217, 219 
COMPUTE, COBOL, 249 
conditional, ALGOL, 219-221 
ALGOL-COBOL differences, 251-252 
COBOL, 251-253 
data-handling, COBOL, 278-279 
DEFINE, COBOL, 256-257 
DIVIDE, COBOL, 250 
else, COBOL, 251-252 
file-handling, COBOL, 282-285 
for, ALGOL, 219, 221-225 
COBOL (see PERFORM, below) 
go to, ALGOL, 214, 218-219 
with conditional statement, 220-221 
in for statement, 225 
in switch declaration, 227-228 
GO TO, COBOL, 250-261, 253 
if, ALGOL, 219 
compound, 249 
COBQL, 261-252 
INCLUDE, COBOL, 254-265, 256 
MOVE, COBOL, 278-979, 280-282, 285 
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Statement, MOVE CORRESPONDING, 
COBOL, 279 
MULTIPLY, COBOL, 250 
nonconditional, COBOL, 284 
OPEN, COBOL, 283, 284 
operational, COBOL, 247-257, 277 
PERFORM, COBOL, 253-254, 255 
procedure, ALGOL, 219, 229-230, 232 
as functions, 234-235 
COBOL (see INCLUDE, above) 
relation to procedure declaration, 
231-232 
READ, COBOL, 283-285 
record-handling, COBOL, 283-285 
SUBTRACT, COBOL, 250 
unconditional, ALGOL, 219, 220 
WRITE, COBOL, 283-285 
Statistics, medical diagnosis, 341-342 
Step-until, ALGOL, 221, 223-224 
Stop instruction, 66, 205n. 
Storage location, definition, 31 
Stored program (see Instruction modifi- 
cation) 
Strategy, game theory, 329 
Stroke, Sheffer, 4427. 
Structures, in ALGOL, 204 
in a language, 199 
Style, musical, 375 
Subblocks, ALGOL, 228 
Subgoals, heuristic programming, 351-352 
Subject, English and Korean, 301 
Sublists, 177, 178 
Subroutine, in ALGOL, 228-229, 231-232 
name, 229, 231 
binomial-coefficient look-up, 78 
in COBOL, 254-257 
column-adding, 79 
data required to use, 81-85 
data supplied to, 84, 165 
definition, 80 
flow chart for setting up, 82 
largest number, 67-68 
in ALGOL, 221, 223 
flow chart, 70 
in one-address, 95 
in symbolic code, 71 
in three-address, 90 
in two-address, 92 
library, 84-85 
missile altitude, 64-66 
flow chart, 69 
in one-address, 94 
in three-address, 89 
in two-address, 91 
missile trajectory, 72-74 
sin 2, 79-80 
square root, 76 
subroutine-modifying, 166-167 
syntax-based translator, 299-301, 306- 
311 
table look-up, 77-78, 117 
translator, 306-311 
in abstraction, 363-365 
verb, COBOL, 256 
(See also Flow chart; Functions) 


Subscript name, COBOL, 253 
Subscript tally condition, COBOL, 253-254 
Subscripted variable, ALGOL, 207 
array declaration, 226 
COBOL, 243, 254, 265-270 
(See also Data-name) 
Substantive-based ordering, 503 
SUBTRACT statement, COBOL, 250 
Subtraction, binary and octal, 61-62 
Subtraction instructions, types, 101 
Successive approximations, 387-393 
Sum, elementary, 452 
Sum-term, ALGOL, 210 
Superimposed irredundant codification, 
526-532, 534 
Switch, ALGOL, 230 
Switch declaration, ALGOL, 227-228, 230 
Switch designator, ALGOL, 227-228 
Switch list, ALGOL, 227 
Switch sensing, one-shot and on-off, 114 
Switches, auxiliary, 104-105, 121 
Syllabary characters, 195, 196 
Symbolic addresses, translating, 182-185 
Symbolic code, 71 
compiling, 182-186 
Honeywell 800 computer, 540-541 
IBM 7090 computer, 543-544 
instructions for translator, 311 
translating, 189 
UNIVAC 1103A computer, 541-542 
Symbolic coding, 70-71 
Symbolic logic, 436 
combined elements, 438 
equivalence, 439 
implication, 439 
tautology, 439-440 
elementary elements, 438 
formal theory, references, 493 
in medical diagnosis, 335-339 
propositional operations, 437-438 
propositions, 436-437 
truth tables, 447-448 
Symbols, basic, Boolean algebra, 448n. 
COBOL, arithmetic, 242, 248 
editing, 242 
flow-chart, 69, 74 
key, translator, 307, 310 
logical operation, 102-103, 438 
ALGOL, 206 
COBOL, 241-242, 247-248 
matrix operation, 474-475 
primitive, 350 
syntactical-definition, 199 
translator, 306 
(See also Notation) 
Symmetry in heuristic programming, 868 
Symptom, medical, 335 
ease of obtaining, 339 
independent, 344 
irrelevant, 369 
redundant, 369 
typical, 368-369 
Symptom complex, 337 
probability, 343 
Sy mptom-disease complex, 3387 
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Symptom-disease profile, 848 
Symptom profile, 838, $42 
Synchronization, 36 
Syntactic heuristic, $52, 455 
Syntactic symmetry, heuristic program- 
ming, 353 
Syntactical analysis, 205 
in abstraction, 863-365 
ALGOL, simple arithmetic expression, 
213 
simple Boolean expression, 210 
(See also Syntax) 
Syntactical definitions of language, 194, 
197-199 
symbols, 199 
Syntactical relations, in abstraction, 363— 
364 
Syntax, automatic-programming language, 
197-198 
English-Korean, 301-304 
house concept, 364 
procedure declaration, ALGOL, 230 
simple arithmetic expression, ALGOL, 
modified, for efficient transla- 
tion, 320-321 
for translator, 312 
statements (see Statement) 
translator based on, 298-324 
USAGE clause, COBOL, 264 
(See also Syntactical analysis) 
Syntax routine, heuristic programming, 
354-355 


T, COBOL, 241 
(See also Logical value) 
Table, idealized concept, 367 
Table look-up, 77-78, 116-118 
Table reference, COBOL, 244 
differentiation by, 408 
Tabledex, 502-503 
manual, computer generation, 505-506 
Tables, in COBOL, 243-244, 265-267 
of constants, 271, 274 
truth, 447-448 
Tally, 74 
Tally condition, ALGOL, 221-223 
COBOL, simple, 253-254 
subscript, 253-254 
Tape (see Magnetic tape; Paper tape) 
Target fragments, 306 
Target language, 299 
instructions for translator, 311 
Tautology, 439-440 
Taylor series, 402, 412, 413 
Temporaries, 66 
Term, ALGOL, arithmetic, 212, 214 
Boolean, 209-211 
Terminology, computer, 31-32 
Test, medical, 835n, 
(See also Symptom) 
THEN, COBOL, 261 
Theorema, geometric, 860-855 
trigonometric, HO6-461 
Theories in logioal probleme, 484 
Theta product of Boolean matrioos, 404 


Threaded lists, 177 
push-down lists in, 179 
Three-address instruction format, 88 
Three-address instruction system, 88—90 
example, 537, 540 
Threshold, neuron, 369-370 
THRU, COBOL, 276 
TIMES, COBOL, 242, 248, 253, 265 
TO, COBOL, 249-250, 279 
TO PROCEED TO, COBOL, 251 
Toymaker’s problem, 330-333 
Transducers, 15 
Transfer instruction, 91-93, 167, 170 
Honeywell 800 computer, 540 
Transformation, Boolean algebraic, 479- 
482 
Transformation operator, heuristic pro- 
gramming, 352 
Transformation types, heuristic program- 
ming, 355-357 
Transition probability, 330-333 
in treatment, 345-348 
Transition value, dynamic programming, 
330-333 
Translating computing unit, 151 
Translation, automatic-programming- 
language, 298-324 
English-Korean, 301-305 
human language, 302 
logical, of English, 444-446 
references, 494 
Translator, syntax-based, 298-324 
in abstraction, 363-365 
comparison with heuristic program, 
355 
Translator compiler, 182-185 
Transpose of Boolean matrix, 474 
Transposition, ordering by, 511-513 
Trapezoidal rule for integration, 406 
ALGOL code, 233 
Treatment decisions, alternative, 325-330 
sequences, 330-333 | 
Tree, split, 113 
Trigonometric theorems, 355-361 
True (see Logical value) 
Truncated errors, 421-422 
Truth tables, 447-448 
Two-address instruction format, 90 
Two-address instruction system, 90-93 
example, 541-542 
simulating one-address in, 168-172 
Type, ALGOL, 226, 227, 230-231, 234 
COBOL item, 261 
(See also CLASS) 
heuristic, 355-357 
label-record, COBOL, 271-273, 274 
searching on, 174-175 
threaded-list, 177 
transformation, heuristic programming, 
355-357 
usage, COBOL, 264 


Type or branch link, list, 297 

Type declaration, ALGOL, 226-228 
Typoeface chain, 45 

Typeeface drum, 44 


ee 
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Uncertainty, decision under, 325, 328-330 

Unconditional statement, ALGOL, 219, 220 

Undetermined coefficients, function evalua- 
tion by, 409-410 

Union of sets, 443 

Unitary matrix equations, 476-478 

UNIVAC 1103A scientific computer, 541— 
542 

UNIVAC X-3 code, 96 

Unnormalized, 99 

Unsigned integer, ALGOL or COBOL, 198— 
199, 215 

Unsigned number, ALGOL, 212 

(See also Decimal number) 

until, ALGOL, 221, 223-224 

UNTIL, COBOL, 253 

USAGE, clause, COBOL, 264, 270 

Usage type, COBOL, 264 


Value, absolute, in ALGOL, 234 
alternate-treat ment-decision, under con- 
ditions, of certainty, 325-326 
of risk, 327-328 
of uncertainty, 328, 329 
condition-name, COBOL, 264-265 
(See also VALUE clause) 
condition-variable, COBOL, 265 
data-name, COBOL, 242, 256 
initial, 270 
decision-state pair, 325 
expression, automatic-programming 
language, 197 
logical, ALGOL, 206, 209 
in Boolean expression, 208-211 
COBOL, 241, 247 
numerical, in ALGOL arithmetic expres- 
sion, 211, 213-214 
subscripted variable, COBOL, 266, 267 
transition, 330-333 
threaded-list type, 177 
VALUE clause, COBOL, 264-265, 270-271 
Value-literal, COBOL, 265 
VALUE OF clause, COBOL, 273-274 
Value part, ALGOL, 230-232 
Variable, ALGOL, 206-207, 208, 209, 212, 
215, 217, 221, 226, 227, 229 
automatic-programming language, 197 
Boolean algebra, 448n. 
transformation, 479-482 
COBOL (see Data-name) 
controlled, ALGOL, 225 
dependent and independent, ALGOL, 
229-231, 235 
COBOL, 255-256 
independent, ALGOL, 208 
local and nonlocal, ALGOL, 228 


Variable, logical, ALGOL, 209 
simple, ALGOL, 207, 208, 230 
subscripted, ALGOL, 207 

in array declaration, 226 
COBOL, 254, 265-270 

Variable connector, 81 
in ALGOL, 227-228 

Variable list, ALGOL, 226, 230 

Variable-word addressing, 106-108 

VARYING, COBOL, 253 

Vector in ALGOL, 226 

Venn diagram, 444, 445 

Verb, COBOL, 243, 249 

(See also Statement) 
English and Korean, 301 

Verb-name, COBOL, 256 

Verb structure, COBOL, 256 

Verb subroutine, COBOL, 256 


Well-formed formulas, in geometry, 350 
in heuristic programming, 357 
while, ALGOL, 221, 224-225 
Whirlwind, 6 
WITH, COBOL, 283 
Word, automatic-programming language, 
195-197 
COBOL, 242-244 
kinds, 242 
as literals, 247 
computer, 31 
end-of-data, 176, 177 
form, 35-36 
inserting single, 121 
interpretation, 32, 35 
labeling, 173 
length, 35 
special read-only, 113 
Word block, 105 
Word format, 39, 62-66, 88, 90, 95-96 
one-address, 94 
IBM 7090 computer, 542 
simulated, 169 
three-address, Honey well 800 computer, 
538 
two-address, 92 
UNIVAC 1103A computer, 541 
Working-storage area, COBOL, 260, 268, 
270 
transfer to or from, 278-279 
WRITE statement, COBOL, 283-285 
Writing a code, steps in, 53-54, 71-72 


X (alphanumerical character), COBOL, 2638 
X-ray diffraction, 10 


ZERO SUPPRESS clause, COBOL, 282 





